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Preface 


Brilliant diamond and carbon black (graphite) are both made of carbon (C). Diamond is an insulator 
while graphite is a good conductor. This difference arises from the lattice structure. Graphite is a 
layered material made up of sheets, each forming a two-dimensional (2D) honeycomb lattice, called 
graphene. The electrical conduction mainly occurs through graphene sheets. Carbon nanotubes were 
discovered by Iijima^ in 1991. The nanotubes ranged from 4 to 30 nm in diameter and were microns 
(pm) in length, had scroll-type structures, and were called Multiwalled Nanotubes (MWNTs) in the 
literature. Single-Wall Nanotubes (SWNTs) have a size of about 1 nm in diameter and microns in 
length. This is a simple two-dimensional material. It is theorists’ favorite system The electrical 
transport properties along the tube present, however, many puzzles, as is explained below. Carbon 
nanotubes are very strong and light. In fact, carbon fibers are used to make tennis rackets. Today’s 
semiconductor technology is based on silicon (Si) devices. It is said that carbon chips, which are 
stronger and lighter, may take the place of silicon chips in the future. It is, then, very important to 
understand the electrical transport properties of carbon nanotubes. The present book has as its 
principal topics electrical transport in graphene and carbon nanotubes. 

The conductivity a in individual carbon nanotubes varies, depending on the tube radius and the 
pitch of the sample. In many cases the resistance decreases with increasing temperature while the 
resistance increases in the normal metal. Electrical conduction in SWNTs is either semiconducting or 
metallic, depending on whether each pitch of the helical line connecting the nearest-neighbor C- 
hexagon centers contains an integral number of hexagons or not. The second alternative occurs more 
often since the pitch is not controlled in the fabrication process. The room-temperature conductivity 
in metallic SWNTs is higher by two or more orders of magnitude than in semiconducting SWNTs. 
Currents in metallic SWNTs do not obey Ohm’s law linearity between current and voltage. Scanned 
probe microscopy shows that the voltage does not drop along the tube length, implying a 
superconducting state. The prevailing theory states that electrons run through the one-dimensional 
(ID) tube ballistically. But this interpretation is not the complete story. The reason why the ballistic 
electrons are not scattered by impurities and phonons is unexplained. We present a new interpretation 
in terms of the model in which superconducting Bose-condensed Cooper pairs (bosons) run as a 
supercurrent. In our text we start with the honeycomb lattice, construct the Fermi surface, and develop 
Bloch electron dynamics based on the rectangular unit cell model. We then use kinetic theory to treat 
the normal electrical transport with the assumption of “electrons,” “holes,” and Cooper pairs as 
carriers. 

To treat the superconducting state, we assume that the phonon-exchange attraction generates Cooper 
pairs (pairons). We start with a Bardeen-Cooper-Schrieffer (BCS)-like Hamiltonian, derive a linear 
dispersion relation for the moving pairons, and obtain a formula for the Bose-Einstein Condensation 
(BEC) temperature 

fc B T c = 1.24* v P it 1 ' 1 , (2D) 

where n is the pairon density and v F the Fermi speed. The superconducting temperature T c given here, 
is distinct from the famous BCS formula for the critical temperature: 3.53k B T c = 2A 0 , where A 0 is the 
zero-temperature electron energy gap in the weak coupling limit. The critical temperature T c for 


metallic SWNTs is higher than 150 K while the T c is much lower for semiconducting SWNTs. 

MWNTs have open-ended circumferences and the outermost walls with greatest radii, contribute 
most to the conduction. The conduction is metallic (with no activation energy factor) and shows no 
pitch dependence. 

In 2007 Novoselov et al.^* discovered the room-temperature Quantum Hall Effect (QHE) in 
graphene. This was a historic event. The QHE in the GaAs/AlGaAs heterojunction is observed 
around 1 K and below. The original authors interpreted the phenomenon in terms of a Dirac fermion 
moving with a linear dispersion relation. But the reason why Dirac fermions are not scattered by 
phonons, which must exist at 300 K, is unexplained. We present an alternative explanation in terms of 
the composite bosons traditionally used in QHE theory. The most important advantage of our bosonic 
theory over the Dirac fermion theory is that our theory can explain why the plateau in the Hall 
conductivity (a xy ) is generated where the zero resistivity (p^ = 0) is observed. 

This book has been written for first-year graduate students in physics, chemistry, electrical 
engineering, and material sciences. Dynamics, quantum mechanics, electromagnetism, and solid state 
physics at the senior undergraduate level are prerequisites. Second quantization may or may not be 
covered in the first-year quantum course. But second quantization is indispensable in dealing with 
phonon-exchange, superconductivity, and QHE. It is fully reviewed in Appendix A. 1. The book is 
written in a self-contained manner. Thus, nonphysics majors who want to learn the microscopic 
theory step-by-step with no particular hurry may find it useful as a self-study reference. 

Many fresh, and some provocative, views are presented. Experimental and theoretical researchers 
in the field are also invited to examine the text. The book is based on the materials taught by Fujita for 
several courses in quantum theory of solids and quantum statistical mechanics at the University at 
Buffalo. Some of the book’s topics have also been taught by Suzuki in the advanced course in 
condensed matter physics at the Tokyo University of Science. The book covers only electrical 
transport properties. For other physical properties the reader is referred to the excellent book 
Physical Properties of Carbon Nanotubes, by R. Saito, G. Dresselhaus and M.S. Dresselhaus 
(Imperial College Press, London 1998). 

The authors thank the following individuals for valuable criticisms, discussions and readings: 
Professor M. de Llano, Universidad Nacional Autonoma de Mexico; Professor Sambandamurthy 
Ganapathy, University at Buffalo, Mr. Masashi Tanabe, Tokyo University of Science and Mr. Yoichi 
Takato, University at Buffalo. We thank Sachiko, Keiko, Michio, Isao, Yoshiko, Eriko, George 
Redden and Kurt Borchardt for their encouragement, reading and editing of the text. 


Shigeji Fujita 
Akira Suzuki 


Buffalo, New York, USA 
Tokyo, Japan 
December, 2012 

1} Iijima, S. (1991) Nature (London), 354 , 56. 

2) Novoselov, K.S. et al. (2007) Science, 315 , 1379. 


Physical Constants, Units, Mathematical Signs and 

Symbols 

Useful Physical Constants 


Quantity 

Symbol 

Value 

Absolute zero temperature 


OK = -273.16°C 

Avogadro’s number 

N a 

6.02 x 10 23 mol -1 

Bohr magneton 

Ob = eh/(2m e ) 

9.27 x 10“ 24 J T _1 

Bohr radius 

a B = 47i8 0 h/(m e e 2 ) 5.29 x 10 11 m 

Boltzmann’s constant 

k B = R/N a 

1.38 x 10 -23 JK" 1 

Coulomb’s constant 

ko = 1/(47180) 

8.988 x 10 9 N m C -2 

Dirac’s constant (Planck’s constant/(27i)) 

h = h/(27i) 

1.05 x 10“ 34 J s 

Electron charge (magnitude) 

e 

1.60 x 10 19 C 

Electron rest mass 

IBg 

9.11 x 10“ 31 kg 

Gas constant 

R = N A k B Z 

8.314 JK -1 mol -1 

Gravitational constant 

G 

6.674 x 10 -11 N m 2 kg -2 

Gravitational acceleration 

g 

9.807 ms -2 

Magnetic flux quantum 

<J> 0 = h/(2e) 

2.068 x 10 -15 Wb 

Mechanical equivalent of heat 


4.184 J cal -1 

Molar volume (gas at STP) 


2.24 x 10 4 cm 3 = 22.4 L 

Permeability of vacuum 

ko 

471 x 10 -7 H m -1 

Permittivity of vacuum 

s 0 

8.85 x 10“ 12 F rrf 1 

Planck’s constant 

h 

6.63 x 10“ 34 J s 

Proton mass 

mp 

1.67 x 10' 27 kg 

Quantum Hall conductance 

e 2 /h 

3.874 x 10“ 6 S 

Quantum Hail resistance 

R h = h/e 2 

25 812.81 G 

Speed of light 

c 

3.00 x 10 8 m s _1 


Subsidiary Units 


newton 

joule 

coulomb 

hertz 

pascal 

bar 


1 N = 1 kg m s " 
1 J = lNm 
1 C = 1 As 
1 Hz = 1 s _1 
lPa=lNm' 2 
1 bar = 10 5 Pa 






Prefixes Denoting Multiples and Submultiples 


10 3 

kilo (k) 

10 6 

mega (M) 

10 9 

giga (G) 

10 12 

tera (T) 

10 15 

peta (P) 

10“ 3 

milli (m) 

1(T 6 

micro ( li ) 

10“ 9 

nano (n) 

10 -12 

pico (p) 

10“ 15 

femto (f) 


Mathematical Signs 


N 

set of natural numbers 

Z 

set of integers 

Q 

set of rational numbers 

9* 

set of real numbers 

C 

set of complex numbers 

Vx 

for all x 

3 x 

existence of x 

h»- 

maps to 


therefore 


because 

= 

equals 


approximately equals 

* 

not equal to 

= 

identical to, defined as 

> 

greater than 

^> 

much greater than 

< 

smaller (or less) than 

« 

much smaller than 

> 

greater than or equal to 

< 

smaller (or less) than or equal to 

OC 

proportional to 


represented by, of the order 

C?(x) 

order of x 


{Xh* 

In 


A* 

d* 

z* = x-iy 


cd 

p-i 

<*(*) 

*U = 


jl if i = j 
|0 if 


the average value of x 

logarithm of base e (natural logarithm) 

increment in x 

infinitesimal increment in x 

complex conjugate of complex number’ z = x +iy [sc, y e 
R, i = imaginary unit = V—T) 

Hermitian conjugate of operator a 
Hermitian conjugate of matrix a 
inverse of P 
Diracs delta function 

Krone deer's delta 


©(*) = 


sgnx = 



x — ds/dt 
grad^ = V<p 


if* > 0 
if* <0 
if* > 0 
if* < 0 


div.4 = V ■ .A 


cur 1.4 = rot4 = V x 4 


V 

A - 


V 2 


Heaviside’s step function 


sign of * 

time derivative 
gradient of <p 
divergence of 4 
curl (or rotation) of 4 
Nabla (or del) operator 
Laplacian operator 


List of Symbols 

The following list is not intended to be exhaustive. It includes symbols of frequent occurrence or 
special importance in this book. 

A angstrom (= KT 8 cm = 10 _10 m) 

A vector potential 

a 0 lattice constant 

a 1? a 2 nonorthogonal base vectors 


B 

Cy 

C 

C Y 

C 

eOO 

x>(g>) 

u(p)» v(k) 

E 

E 

Ep 

E 

Eh 

e 

F 


magnetic field (magnetic flux density) 
specific heat at constant volume 
heat capacity per particle 
heat capacity per unit volume 
speed of light 

density of states in energy space 

density of states in angular frequency 

density of states in momentum space 

total energy 

internal energy 

Fermi energy 

electric field vector 

electric field vector due to the Hall voltage 
electronic charge (absolute value) 
orthogonal unit vectors 
Helmholtz free energy 


f 

% 

f F 

f 0 

G 

g 

n 

H a 

H c 

h 

h 

h 

I 

is V^l 

i, j, k 
J 

j 

j 

K 

k 

ko 

k B 

£ 

L 

£ 

1 

M 

* 

M 

M 

m 

* 

m 

* 

m 

N 

M 

N l 

n 

n c 

n P 

P 

P 

P 


one-body distribution function 
Bose distribution function 
Fermi distribution function 
Planck distribution function 
Gibbs free energy 
g-factor 
Hamiltonian 

applied magnetic field vector 

critical magnetic field (magnitude) 

Planck’s constant 

single-particle Hamiltonian 

Planck’s constant divided by 2n 

magnetization 

imaginary unit 

Cartesian unit vectors 

total current 

single-particle current 

current density 

thermal conductivity 

wave vector (k-vector) 

Coulomb’s constant 
Boltzmann constant 
Lagrangian function 
normalization length 
mean free path 
angular momentum 
molecular mass 

magnetotransport mass 
(symmetric) mass tensor 
electron mass 

cyclotron mass 

effective mass 

number of particles 

number operator 

Landau level 

particle number density 

number density of the dressed electrons 

number density of pairons 

pressure 

total momentum 

momentum vector 


p 

Q 

q 

q 

R 

R 

R 

Rh 

r 

r 

S 

S 

T 

T 0 

T c 

T F 

r 

TR 

Tr 

tr 

V, v 

V H 

V 

v 

v 

v thermal 

v d 

v d(= l v dl) 

Vp 

Vp(= |v F |) 

W 

w 

Z 

a = -e/(2m) 
e a = z 

P s l/(k B T) 

X 

£ 

£p 

£ g 

£ P 

0(x) 


momentum (magnitude) 
quantity of heat 
heat (energy) current 
charge 
resistance 

Bravais lattice vector 

position vector of the center of mass 

Hall coefficient 

radial coordinate 

position vector 

entropy 

Seebeck coefficient 
absolute temperature 
transition temperature 
critical temperature 
Fermi temperature 
kinetic energy 
grand ensemble trace 
many-particle trace 
one-particle trace 
volume 
Hall voltage 
potential energy 
speed (magnitude of v) 
velocity 

thermal velocity 
drift velocity 
drift speed 
Fermi velocity 
Fermi speed 
work 

wrapping vector 
partition function 

magnetogyric (magnetomechanical) ratio 
fugacity 

reciprocal temperature 
magnetic susceptibility 
single-particle energy 
Fermi energy 
energy gap 
pairon energy 
step function 


0 

X 

X 

),(= e^ 1 ) 

K 

K 

V 

V 


4 

p 

p 

p 

p 

P(B) 

PH 

o 

o 

°H 

a x’ °y> c z 

T 

9 

* 

* 

<D 

Oq 

¥ 

9 

da = sin 0d0d4> 
(0 = 271V 

®C 

®C 

©D 

(I 

I) 

(hkl), [hkl], {hkl} 


polar angle 

wavelength 
penetration depth 

fugacity 
curvature 
quantum state 
chemical potential 
magnetic moment 
Bohr magneton 
frequency 

Landau level occupation ratio (filling factor) 

grand partition function 

dynamical variable 

coherence length 

mass density 

density operator 

many-particle distribution function 

resistivity 

magnetoresistivity 

Hall resistivity 

total cross section 

electrical conductivity 

Hall conductivity 

Pauli spin matrices 

relaxation time 

collision time, average time between collision 

duration of collision 

distribution function 

azimuthal angle 

scalar potential 

magnetic flux 

flux quantum 

quasiwavefunction for many condensed bosons 

wavefunction for a quantum particle 

element of solid angle 

angular frequency 

cyclotron frequency 

rate of collision (collision frequency) 

Debye frequency 
bra vector 
ket vector 

crystallographic notation 


[,] 

commutator brackets 

{,} 

anticommutator brackets 

{,} 

Poisson brackets 

Units 



In much of the literature quoted, the unit of magnetic field B is the gauss. Electric fields are frequently 
expressed in V cm -1 and resistivity in Q cm 

I tesla (T) = 10 4 gauss (G, (Gs)) 1 Q rn = 10 2 Q cm 

The Planck constant h over 2n, h = h/(27t), is used in dealing with an electron. The original Planck 
constant h is used in dealing with a photon. 

Crystallographic Notation 

This is mainly used to denote a direction, or the orientation of a plane, in a cubic metal. A plane (hkl) 
intersects the orthogonal Cartesian axes, coinciding with the cube edges, at a/h, a/k, and a/1 from the 
origin, a being a constant, usually the length of a side of the unit cell. The direction of a line is 
denoted by [hkl], the direction cosines with respect to the Cartesian axes being h/N, k/N, and 1/N, 
where N 2 = h 2 + k 2 + l 2 . The indices may be separated by commas to avoid ambiguity. Only 
occasionally will the notation be used precisely; thus, [100] or [001] usually means any cube axis and 
[HI], any diagonal. 

BandH 

When an electron is described in quantum mechanics, its interaction with a magnetic field is 
determined by B rather than H; that is, if the permeability p is not unity, the electron motion is 
determined by pH. It is preferable to forget H altogether and use B to define all field strengths. The B 
is connected with a vector potential A such that B = V x A. The magnetic field B is effectively the 
same inside and outside the metal sample. 


List of Abbreviations 


ID 

one dimensional 

2D 

two dimensional 

3D 

three dimensional 

ARPES 

angle-resolved photoemission spectroscopy 

bcc 

body-centered cubic 

BCS 

Bardeen-Cooper-Schrieffer 

BEC 

Bose-Einstein condensation 

c- 

composite- 

c.c. 

complex conjugate 

C- 

carbon- 

CM 

center of mass 

CNT 

carbon nanotube 



cub, cub 

cubic 

dHvA 

de Haas-van Alphen 

dia 

diamond 

DOS 

density of states 

DP 

Dirac picture 

“electron” 

see p. 7 

EOB 

Ehrenfest-Oppenheimer-Bethe 

f(c-) 

fundamental (composite-) 

fee 

face-centered cubic 

h.c. 

Hermitian conjugate 

hep 

hexagonal closed packed 

“hole” 

see p. 7 

hex 

hexagonal 

HP 

Heisenberg picture 

HRC 

high-resistance contacts 

HTSC 

high-temperature superconductivity 

KP 

Kronig-Penney 

Ihs 

left-hand side 

LL 

Landau level 

LRC 

low-resistance contacts 

mcl, mcl 

monoclinic 

MIT 

metal-insulator transition 

MR 

magnetore sistanc e 

MWNT 

multiwalled (carbon) nanotube 

NFEM 

nearly free electron model 

NT 

nanotube 

ore 

orthorhombic 

QH 

quantum Hall 

QHE 

quantum Hall effect 

rhs 

right-hand side 

rhl 

rhombohedral 

sc 

simple cubic 

SdH 

Shubnikov-de Haas 

SP 

Schrodinger picture 

sq 

square 

SQUID 

superconducting quantum interference device 

SWNT 

single-wall (carbon) nanotube 

tel 

triclinic 

tet 

tetragonal 

vrh 

variable range hopping 

WS 

Wigner-Seitz 

ZBA 

zero-bias anomaly 



Chapter 1 


Introduction 


1.1 Carbon Nanotubes 

Graphite and diamond are both made of carbons. They have different lattice structures and different 
properties. Diamond is brilliant and it is an insulator while graphite is black and it is a good 
conductor. 

In 1991 Iijima [1] discovered carbon nanotubes (CNTs) in the soot created in an electric discharge 
between two carbon electrodes. These nanotubes ranging from 4 to 30 nm in diameter were found to 
have helical multiwalled structures as shown in Figures 1.1 and L2 after electron diffraction 
analysis. The tube length is about 1 pm 

Figure 1.1 Schematic diagram showing (a) a helical arrangement of graphitic carbons and (b) its 
unrolled plane. The helical line is indicated by the heavy line passing through the centers of the 
hexagons. 


p 


(a) (b) 

Figure 1.2 A multiwalled nanotube. The tube diameter ranges from 4 to 30 nm and its length is about 
1 pm (Original figure, Iijima [1]) 



The scroll-type tube shown in Figure 1.2 is called a multiwalled carbon nanotube (MWNT). A 
single-wall nanotube (SWNT) was fabricated by Iijima and Ichihashi [2] and by Bethune et al. [3] in 
1993. Their structures are shown in Figure 1.3 . 



















Figure 1.3 SWNTs with different chiralities and possible caps at each end: (a) shows a so-called 
armchair carbon nanotube (CNT), (b) a zigzag CNT, and (c) a general chiral CNT. One can see from 
the figure that the orientation of the C-hexagon in the honeycomb lattice relative to the tube axis can 
be taken arbitrarily. The terms “armchair” and “zigzag” refer to the arrangement of C-hexagons 
around the circumference. 

(From [4, 5]). 



The tube is about 1 nm in diameter and a few micrometers in length. The tube ends are closed as 
shown. Because of their small radius and length-to-diameter ratio > 10 4 , they provide an important 
system for studying two-dimensional (2D) physics, both theoretically and experimentally. Unrolled 
carbon sheets are called graphene.^ They have a honeycomb lattice structure as shown in Figure 
1.1b . 

A SWNT can be constructed from a slice of graphene (that is a single planar layer of the 
honeycomb lattice of graphite) rolled into a circular cylinder. 

Carbon nanotubes are light since they are entirely made of the light element carbon (C). They are 
strong and have excellent elasticity and flexibility. In fact, carbon fibers are used to make tennis 
rackets, for example. Their main advantages in this regard are their high chemical stability as well as 
their strong mechanical properties. 

Today’s semiconductor technology is based mainly on silicon (Si). It is said that carbon-based 
devices are expected to be as important or even more important in the future. To achieve this purpose 
we must know the electrical transport properties of CNTs, which are very puzzling, as is explained 
below. The principal topics in this book are the remarkable electrical transport properties in CNTs 
and graphene on which we will mainly focus in the text. 

The conductivity a in individual CNTs varies, depending on the tube radius and the pitch of the 
sample. In many cases the resistance decreases with increasing temperature. In contrast the resistance 
increases in the normal metal such as copper (Cu). The electrical conduction properties in SWNTs 
separates samples into two classes: semiconducting or metallic. The room-temperature conductivities 
are higher for the latter class by two or more orders of magnitude. Saito et al. [6] proposed a model 
based on the different arrangements of C-hexagons around the circumference, called the chiralities. 









Figure 1.3a-c show an armchair, zigzag, and a general chiral CNT, respectively. After statistical 
analysis, they concluded that semiconducting SWNTs should be generated three times more often than 
metallic SWNTs. Moriyama etal. [7] fabricated 12 SWNT devices from one chip, and observed that 
two of the SWNT samples were semiconducting and the other ten were metallic, a clear discrepancy 
between theory and experiment. We propose a new classification. The electrical conduction in 
SWNTs is either semiconducting or metallic depending on whether each pitch of the helical line 
connecting the nearest-neighbor C-hexagon contains an integral number of hexagons or not. The 
second alternative (metallic SWNT) occurs more often since the helical angle between the helical 
line and the tube axis is not controlled in the fabrication process. In the former case the system 
(semiconducting SWNT) is periodic along the tube length and the “holes” (and not “electrons”) can 
travel along the wall. Here and in the text “electrons” (“holes”), by definition, are quasielectrons 
which are excited above (below) the Fermi energy and which circulate clockwise (counterclockwise) 
when viewed from the tip of the external magnetic field vector. “Electrons” (“holes”) are generated in 
the negative (positive) side of the Fermi surface which contains the negative (positive) normal vector, 
with the convention that the positive normal points in the energy-increasing direction. In the Wigner- 
Seitz (WS) cell model [7] the primitive cell for the honeycomb lattice is a rhombus. This model is 
suited to the study of the ground state of graphene. For the development of the electron dynamics it is 
necessary to choose a rectangular unit cell which allows one to define the effective masses associated 
with the motion of “electrons” and “holes” in the lattice. 

Silicon (Si) (germanium (Ge)) forms a diamond lattice which is obtained from the zinc sulfide 
(ZnS) lattice by disregarding the species. The electron dynamics of Si are usually discussed in terms 
of cubic lattice languages. Graphene and graphite have hexagonal lattice structures. Silicon and 
carbon are both quadrivalent materials but because of their lattice structures, they have quite different 
physical properties. 

1.2 Theoretical Background 

1.2.1 Metals and Conduction Electrons 

A metal is a conducting crystal in which electrical current can flow with little resistance. This 
electrical current is generated by moving electrons. The electron has mass m and charge —e, which is 
negative by convention. Their numerical values are m= 9.1 x 10 -28 g and e = 4.8 x 10 -10 esu = 1.6 x 
10 -19 C. The electron mass is about 1837 times smaller than the least-massive (hydrogen) atom This 
makes the electron extremely mobile. It also makes the electron’s quantum nature more pronounced. 
The electrons participating in the transport of charge are called conduction electrons. The conduction 
electrons would have orbited in the outermost shells surrounding the atomic nuclei if the nuclei were 
separated from each other. Core electrons which are more tightly bound with the nuclei form part of 
the metallic ions. In a pure crystalline metal, these metallic ions form a relatively immobile array of 
regular spacing, called a lattice. Thus, a metal can be pictured as a system of two components: mobil 
electrons and relatively immobile lattice ions. 



1.2.2 Quantum Mechanics 

Electrons move following the quantum laws of motion. A thorough understanding of quantum theory is 
essential. Dirac’s formulation of quantum theory in his book, Principles of Quantum Mechanics [9], is 
unsurpassed. Dirac’s rules that the quantum states are represented by bra or ket vectors and physical 
observables by Hermitian operators are used in the text. There are two distinct quantum effects, the 
first of which concerns a single particle and the second a system of identical particles. 

1.2.3 Heisenberg Uncertainty Principle 

Let us consider a simple harmonic oscillator characterized by the Hamiltonian 

p 2 kx 2 

where m is the mass, k the force constant, p the momentum, and x the position. The corresponding 
energy eigenvalues are 

0 ^ 2 ) £ *+ 1 ) ■ “•-(£) - 

The energies are quantized in (1.2) . In contrast the classical energy can be any positive value. The 
lowest quantum energy s 0 = hco 0 /2, called the energy of zero-point motion, is not zero. The most stable 

state of any quantum system is not a state of static equilibrium in the configuration of lowest potential 
energy, it is rather a dynamic equilibrium for the zero-point motion [10, 11]. Dynamic equilibrium 
may be characterized by the minimum total (potential + kinetic) energy under the condition that each 
coordinate q has a range Aq and the corresponding momentum p has a range Ap, so that the product 
AqAp satisfies the Heisenberg uncertainty relation: 

(1.3) a * j a p > ■ 

The most remarkable example of a macroscopic body in dynamic equilibrium is liquid helium (He). 
This liquid with a boiling point at 4.2 K is known to remain liquid down to 0 K. The zero-point 
motion of He atoms precludes solidification. 


1.2.4 Bosons and Fermions 

Electrons are fermions. That is, they are indistinguishable quantum particles subject to the Pauli 
exclusion principle. Indistinguishability of the particles is defined by using the permutation symmetry. 
According to Pauli’s principle no two electrons can occupy the same state. Indistinguishable quantum 
particles not subject to the Pauli exclusion principle are called bosons. Bosons can occupy the same 
state with no restriction. Every elementary particle is either a boson or a fermion. This is known as 
the quantum statistical postulate. Whether an elementary particle is a boson or a fermion is related to 
the magnitude of its spin angular momentum in units of h. Particles with integer spins are bosons, 
while those with half-integer spins are fermions [12]. This is known as Pauli’s spin-statistics 
theorem According to this theorem and in agreement with all experimental evidence, electrons, 
protons, neutrons, and p-mesons, all of which have spin of magnitude b/2, are fermions, while 
photons (quanta of electromagnetic radiation) with spin of magnitude h, are bosons. 





1.2.5 Fermi and Bose Distribution Functions 


The average occupation number at state k, denoted by {n k ), for a system of free fermions in 

equilibrium at temperature T and chemical potential p is given by the Fermi distribution Junction: 

l 


(1 4) K) = 


for fermions, 


mhn)+ 1 

where e k is the single-particle energy associated with the state k. The average occupation number at 
state k for a system of free bosons in equilibrium is given by the Bose distribution junction: 

{ft*} = fa{£k) = —- —— for bosons. 


(1.5) 
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1.2.6 Composite Particles 

Atomic nuclei are composed of nucleons (protons, neutrons), while atoms are composed of nuclei and 
electrons. It has been experimentally demonstrated that these composite particles are 
indistinguishable quantum particles. According to Ehrenfest-Oppenheimer-Bethe’s rule [12, 13], the 
center of mass (CM) of a composite moves as a fermion (boson) if it contains an odd (even) number 
of elementary fermions. Thus, He 4 atoms (four nucleons, two electrons) move as bosons while He 3 
atoms (three nucleons, two electrons) move as fermions. Cooper pairs (two electrons) move as 
bosons. 


1.2.7 Quasifree Electron Model 

In a metal at the lowest temperatures conduction electrons move in a nearly stationary periodic 
lattice. Because of the Coulomb interaction among the electrons, the motion of the electrons is 
correlated. However, each electron in a crystal moves in an extremely weak self-consistent periodic 
field. Combining this result with the Pauli exclusion principle, which applies to electrons with no 
regard to the interaction, we obtain the quasifree electron model. The quasifree electron moves with 
the effective mass m* which is different from the gravitational mass nig. In this model the quantum 
states for the electron in a crystal are characterized by wave vector (k vector: k) and energy 
(1.6) £ = 

At 0 K, all of the lowest energy states are filled with electrons, and there exists a sharp Fermi surface 
represented by 

(1.7) m = cj, 

where e F is the Fermi energy. Experimentally, the electrons in alkali metals, which form body- 

centered cubic (bee) lattices, including lithium (Li), sodium (Na), and potassium (K), behave like 
quasifree electrons. 

1.2.8 “Electrons” and “Holes” 

“Electrons” (“holes”) in the text are defined as quasiparticles possessing charge e (magnitude) that 
circulate counterclockwise (clockwise) when viewed from the tip of the applied magnetic field 




vector B. This definition is used routinely in semiconductor physics. We use the quotation-marked 
“electron” to distinguish it from the generic electron having the gravitational mass nig. A “hole” can 

be regarded as a particle having positive charge, positive mass, and positive energy. The “hole” does 
not, however, have the same effective mass m* (magnitude) as the “electron,” so that “holes” are not 
true antiparticles like positrons. We will see that “electrons” and “holes” are thermally excited 
particles and they are closely related to the curvature of the Fermi surface (see Chapter 3). 

1.2.9 The Gate Field Effect 

Graphene and nanotubes are often subjected to the so-called gate voltage in experiments. We will 
show here that the gate voltage polarizes the conductor and hence the surface charges (“electrons,” 
“holes”) are induced. The actual conductor may have a shape and a particular Fermi surface. But in 
all cases surface charges are induced by electric fields. If a bias voltage is applied, then some 
charges can move and generate currents. 

A. 

Let us take a rectangular metallic plate and place it under an external electric field E, see Figure 1.4 . 
Figure 1.4 The surface charges are induced in the conductor under an external electric field E. 
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When the upper and lower sides are parallel to the field E, then the remaining two side surfaces are 
polarized so as to reduce the total electric field energy. If the plate is rotated, then all side surfaces 
are polarized. 

B. 

Let us now look at the electric field effect in k-space. Assume a quasifree electron system which has 
a spherical Fermi surface at zero field. Upon the application of a static field E, the Fermi surface will 
be shifted towards the right by qEi/m*, where t is the mean free time and m* the effective mass, as 
shown in Figure 1.5 . There is a steady current since the sphere is off from the center O. We may 
assume that the ionic lattice is stationary. Then, there is an unbalanced charge distribution as shown, 
where we assumed q = -e < 0. This effect will appear only on the surface of the metal. We used the 
fermionic nature of electrons in B. 



Figure 1.5 The Fermi surface is shifted by eEx/m* due to the electric field E. 
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1.3 Book Layout 

In Chapters 2 and 3 kinetic theory and Bloch electron dynamics are developed, respectively Phonon 
and electron-phonon interaction are discussed in Chapter 4. These chapters are preliminaries for the 
theory of the conductivity of carbon nanotubes, which is discussed in Chapters 5 and 6. 
Semiconducting SWNTs are discussed in Chapter 6. A quantum statistical theory of superconductivity 
is summarized in Chapter 7. Chapter 8 deals with the supercurrents in metallic SWNTs, starting with 
the BCS-like Hamiltonian and deriving expressions for a linear dispersion relation, and a critical 
(superconducting) temperature. Metallic SWNTs exhibit non-Ohmic behavior, and charged particles 
appear to run through the tube length with no scattering. We interpret this in terms of the condensed 
Cooper pairs (pairons). 

An applied static magnetic field induces a profound change in the electron states. Pauli’s 
paramagnetic and Landau’s diamagnetism are described in Chapter 9. Landau states generate an 
oscillatory density of states that induces de Haas-van Alphen oscillation which is discussed in 
Chapter 10. The Quantum Hall Effect (QHE) in GaAs/AlGaAs is summarized in Chapter 11. The 
QHE in graphene observed at room temperature is discussed in Chapter 12. The QHE occurs where 
the “hole” (“electron”) density becomes high near the neck Fermi surface, which develops by 
charging the graphene through the gate voltage. The different temperatures generate different carrier 
densities and the resulting carrier diffusion generates a thermal electromotive force. A new formula 
for the Seebeck coefficient is obtained and is applied to multiwalled carbon nanotubes in Chapter 13. 
In Chapter 14, we, discuss miscellaneous topics. 

1.4 Suggestions for Readers 

Graphene and CNTs are composed entirely of carbons but their lattice structures are distinct from 
each other. The simple free electron model does not work. To describe the electrical conduction of 
graphene and CNTs it is necessary to understand a number of advanced topics including 
superconductivity and Fermi surfaces. 

1.4.1 Second Quantization 

Reading Chapter 7 Superconductivity requires a knowledge of second quantization. The authors 
suggest that the readers learn the second quantization in two steps. 

1. Dirac solved the energy-eigenvalue problem for a simple harmonic oscillator in the 
Heisenberg picture, using creation and annihilation operators (a^, a), see Chapter 4, Section 4.3. 
We follow Dirac [9] and obtain the eigenvalues, (n' + l/2)hco, where n' is the eigenvalues of n = 
at a ,n' = 0, 1,2, ... 

2. Read Appendix A. 1, where a general theory for a quantum many-boson and fermion system is 
presented. 



1.4.2 Semiclassical Theory of Electron Dynamics 

Electrons and phonons are regarded as waves packets in solids. Dirac showed that the wave packets 
move, following classical equations of motion [9]. The conduction electron (“electron,” “hole”) size 
is equal to the orthogonal unit cell size. The phonon size is about two orders of magnitude greater at 
room temperature. The “electron” and “hole” move with effective masses m* which are distinct from 
the gravitational effective mass nig. Bloch electron dynamics are described in Chapter 3. 


1.4.3 Fermi Surface 

The time-honored WS cell model can be used for cubic lattice systems including a diamond lattice. 
For hexagonal systems including graphene and graphite an orthogonal unit cell model must be used to 
establish the k-space. Read Sections 5.2 and 5.4. The same orthogonal unit cell model must be used 
for the discussion of phonons. 

In our quantum statistical theory we do not jump to conclusions. We make arguments backed up by 
step-by-step calculations. This is the surest way of doing and learning physics for ordinary men and 
women. 
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1} Graphene is the basic structural element of some carbon allotropes including graphite, CNTs, 
and fullerenes. The name comes from graphite + -ene; graphite itself is composed of many 
graphene sheets stacked together. Graphene as a name indicates a single, two-dimensional layer of 
three-dimensional graphite which contains many layers of carbon hexagons. Two-dimensional 
graphene can exist in nature as the spacing between layers (3.35 A) is longer than the distance to 
neighboring atoms a c _ c (1-42 A) within the same plane. It has been challenging to isolate one layer 

from bulk graphite. 


Chapter 2 


Kinetic Theory and the Boltzmann Equation 


Elements of the kinetic theory of gas dynamics, the Boltzmann equation method, and electrical 
conduction are discussed in this chapter. 


2.1 Diffusion and Thermal Conduction 

In order to clearly understand diffusion let us look at the following simple situation. Imagine that four 
particles are in space a, and two particles are in space b as shown in Figure 2.1 . 

Figure 2.1 If the particles flow out in all directions with no preference, there will be more particles 

crossing the imaginary boundary CC’ in the a to b direction than in the opposite direction. 
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Assuming that both spaces a and b have the same volume, we may say that the particle density is 
higher in a than in b. We assume that half of the particles in each space will be heading toward the 
boundary CC’. It is then natural to expect that in due time two particles would cross the boundary CC’ 
from a to b, and one particle from b to a. This means that more particles would pass the boundary 
from a to b, that is, from the side of high density to that of low density. This is the cause of diffusion. 
The essential points in the above arguments are the reasonable assumptions that 

(i) the particles flow out from a given space in all directions with the same probability, and 

(ii) the rate of this outflow is proportional to the number of particles contained in that space. 

In the present case condition (i) will be assured by the fact that each particle collides with other 
particles frequently so that it may forget how it originally entered the space, and may leave with no 
preferred direction. From a more quantitative perspective it is found that the particle current j is 
proportional to the density gradient V n with n being the number density: 

( 2 . 1 ) j=-DV»> 

where D is called the diffusion coefficient. This linear relation (2.11 is called Fick’s law [1,2]. 

Consider next thermal conduction. Assume that the spaces a and b are occupied by the same number 
of the particles. Further assume that the temperature T is higher in b than in a. Then, the particle speed 











is higher in b than in a on the average. In due time a particle crosses the boundary CC’ from a to b and 
another crosses the boundary CC’ fromb to a. Then, the energy is transferred through the boundary In 
a more detailed study Fourier’s law [1,2] is observed: 

(2.2) g = -kvt , 

where q is the heat (energy) current and the proportionality constant K is called the thermal 
conductivity. 


2.2 Collision Rate: Mean Free Path 


Let us consider a particle moving through a medium containing n molecules per unit volume. If the 
particle proceeds with a speed v, it will sweep the volume (vdt) x A = v Adt during the time interval 
dt, where A represents the cross section. See Figure 2.2 . 


Figure 2.2 A particle moving with a speed v sweeps the volume (vdt)A during the time dt, where A 
represents the cross section. 
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The particle would collide with any molecule if the latter lies within the cylinder. Now, the number 
of molecules in the cylindrical volume vdt is n(v Adt) = nv Adt. Dividing this number by dt, we 
obtain the number of collisions per unit time: 


t2.3I <»c = nvA. 

This frequency co c is called the collision rate. Note that the collision rate depends linearly on the 
speed v, the number density n, and the cross section A. 

The above consideration may be applied to the molecular collision in a gas. In this case, the 
particle in question is simply one of the molecules. Let us estimate the collision rate for a typical gas. 
For neon gas, the interaction potential between two atoms has a range of a few angstroms, 

(2.4) ft (range} fy 2 A = 2 x 10 —3 cm . 

Therefore, the total cross section A has the following order of magnitude: 

(2.5) A = Jtft 2 jy 3.14x (2x IQ 8 ) 2 cm 2 = 1.2 x 10~ 1S cm 2 . 

Atypical atom has a kinetic energy of the order 3/2k B T. Therefore, it has the thermal speed 
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Using the data T = 273 K and m (neon) = 20 rr^ (proton mass), we then obtain 


(2.7) V *.— -I = 5.9 X 10 4 cms —1 = 590m s _1 . 

It is interesting to note that this molecular speed has the same order of magnitude as the speed of 
sound 340 ms -1 . At 0° C and 1 atmospheric pressure, the number of molecules per cm 3 is 

















(2.8) n = 2.69 x 10 1 * cm 3 . 

If we substitute the values from (2.5W2.7) into 12.3) . we obtain 
t2.91 <o c = 1.91 x 10* s _1 . 

Here we see that atoms collide with each other an enormous number of times in a second. 

Strictly speaking, we should choose the average speed of approach between atoms rather than the 
thermal speed when estimating the value of v when both molecules move. Such a choice, however, 
will not change the order of magnitude of oo c . In a more quantitative treatment, this choice becomes 

important. 

The inverse of the collision rate defined by 

_ _i_ l 
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is called the collision time, or the average time between collisions. Substituting the numerical value 
from 12.91 . we get 

mn T - - i.9i " 522 x 10 '” >s • 

Let us now compare this time x c with the average collision duration, the average time that the 
molecule spends within the force range R of another particle. The latter is defined by 

K 

(2.12) Td= inf¬ 
using the numerical values from (2.5) and (2.6) . we have 
12.131 Td = 3.4 x to -13 s. 

Comparison between 12.111 and 12.131 shows that 

(2.14) T c »T d . 

This means that in a typical gas the molecules move freely most of the time, and occasionally collide 
with each other. 

By multiplying the thermal speed v thermal by the average time between collision x c , we obtain 

12. 151 ^iliermai xt c = ! (mean free path). 

This quantity, called the mean free path, gives a measure of the distance that a typical molecule 
covers between successive collisions. From 12.101 and 12.151 . we obtain 

(2. 1 6) ^ — ^thermal ^ = Wher mal X ^ ftAVrhcr mfll ) — (ftA) ■ 

Note that the mean free path does not depend on the speed of the particle, and therefore has a value 
independent of temperature. Introducing the numerical values from 12.71 and 12.111 . we obtain 
(2.17) t w 3.1 x nr 5 cm = iiooA » j?(~2A). 

We see that the mean free path is about three orders of magnitude greater than the force range. 
Problem 2.2.1. Using the numerical values introduced in the present section, estimate 

1. The probability of a particular particle being within the force range of another particle, and 

2. The probability of a particular particle being within the force range of two particles 
simultaneously. 

Problem 2.2.2. Assume that the probability of collision for a small distance Ax is given by Ax/1. 
Show that the probability of the particle proceeding without collision for a finite distance x is given 






















by exp(-x/l). Hint: lim lwrx (1 - x/n) n = e 


2.3 Electrical Conductivity and Matthiessen’s 
Rule 


Let us consider a system of electrons moving independently in a potential field of impurities, which 
act as scatterers. The inpurities are assumed to be distributed uniformly. 

Under the action of an electric field E that points along the positive x-axis, a classical electron with 
mass m will move following Newton’s equation of motion: 
dv* 

(2.18) m ir --e£ ' 

in the absence of an impurity. Solving this, we obtain 

(2.19) + 

where v x ° is the x-component of the initial velocity. For a free electron the velocity v x can increase 
indefinitely and leads to infinite conductivity. 

In the presence of inpurities, the uniform acceleration will be interrupted by scattering. When the 
electron hits a scatterer (impurity), the velocity will suffer an abrupt change in direction and grow 
again following t2.19I until the electron hits another scatterer. Let us denote the average time between 
successive scatterings or mean free time by x f . The average velocity (v x ) is then given by 

12.201 {Vx) = _ ^ £Tf ’ 

where we assumed that the electron forgets its preceding motion every time it hits a scatterer, and the 
average initial velocity after collision is zero: 

(2.21) W = °- 

The charge current density (average current per unit volume) j is given by 

j = (charge : —e) x (Dumber density : n) x (velocity : (v*)) 

(2.22) —- %r s • 

where we used f2.20L According to Ohm’s law, the current density j is proportional to the applied 
electric field E when this field is small: 


(2.23) j-oS. 

The proportionality factor a is called the electrical conductivity [1, 2], It represents the facility 
with which the current flows in response to the electric field. Combining the last two equations, we 
obtain 



This equation is used in the qualitative discussion of the electrical transport phenomenon. The 
inverse mass-dependence law means that the ion contribution to electrical transport in an ionized gas 
will be smaller by at least three orders of magnitude than the electron contribution. Also note that the 








conductivity is higher if the number density is greater and if the mean tree time is greater. 

The inverse of the mean tree time x f , 

r= L 

(2.25) T f 

is called the scattering rate or the relaxation rate. Roughly speaking this T represents the mean 
frequency with which the electron is scattered by inpurities (scatterers). The scattering rate T is 
given by 

(2.26) r = n : vA, 

where n I? v, and A are respectively the density of scatterers, the electron speed, and the scattering 
cross section. 

If there is more than one kind of scatterer, the scattering rate may be computed by the addition law: 

(2.27) r = -i-njvjAj + ... = Fi-\-r 2 -\-... 

This is called Matthiessen’s rule. 

Historically and still today, the analysis of resistance data for a conductor is performed as follows: 
if the electrons are scattered by inpurities and again by phonons (quanta of lattice vibrations), the 
total resistance will be written as the sum of the resistances due to each separate cause of scattering: 


( 2 . 28 ) — iiimpuiily “I" ^phonon ■ 

This is the original statement of Matthiessen’s rule. In further detail, the electron-phonon scattering 
depends on temperature because of the changing phonon population while the effect of the electron- 
impurity scattering is temperature-independent. By separating the resistance into two parts, one 
temperature-dependent and the other temperature-independent, we may apply Matthiessen’s rule. 
Problem 2.3.1. Free electrons are confined within a long rectangular planer strip. Assume that each 
electron is diffusely scattered at the boundary so that it may move in all directions without preference 
after the scattering. Find the mean free path along the length of the strip. Calculate the conductivity. 
Problem 2.3.2. Assume the same condition as in Problem 2.3.1 for the case in which electrons are 
confined within a long circular cylinder. Find the conductivity. 


2.4 The Hall Effect: “Electrons” and “Holes” 

In this section we discuss the Hall effect, which was discovered in 1879 by E.H. Hall and published 
in [3]. The “electron” (“hole”) has a negative (positive) charge —e (+e), e = 4.80 x 10 _10 esu = 1.6 x 
10 _19 C. As we see later, “electrons” and “holes” play very important roles in the microscopic theory 
of electrical transport and superconductivity. Let us consider a conducting wire connected to a 
battery. If a magnetic field B is applied, the field penetrates the wire. The Lorentz force, 

(2.29) F = gvx b , 

where q is the charge of the carrier, may then affect the electron’s classical motion. If so, the picture 
of the straight line motion of a free electron in kinetic theory has to be modified significantly. If the 
field B (magnitude) is not too high and the stationary state is considered, the actual physical situation 
turns out to be much simpler. 

Take the case in which the field B is applied perpendicular to a wire of rectangular cross section as 


shown in Figure 2.3 . 

Figure 2.3 Schematic view of Hall’s experiment. The magnetic and electric forces (F B , F E ) are 
balanced to zero in the Hall effect measurement. 
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Experiments show that a voltage V H is generated perpendicular to the field B and electric current j 

such that a steady current flows in the wire apparently unhindered. This is the essence of the Hall 
effect. We may interpret this condition as follows. Let us write the current density j as 

f2.3(F) j = qnv A , 

where n is the density of conduction electrons and v d the drift velocity. A charge carrier having a 
velocity equal to the drift velocity v d is affected by the Lorentz force: 

(2.31) F-*(*h+imxJ1> p 

where E H is the electric field due to the Hall voltage V H . In the geometry shown in Figure 2.3 . only 
the x-component of the force F is relevant. If the net force vanishes: 

(2.32) g{EH-tvaB) = Q, 

then the carrier can proceed along the wire (z-direction) unhindered. 

Let us check our model calculation. We define the Hall coefficient R H by 

_ e h 

(2.33) Rh= Tb’ 

where the three qualities (E H , j, B) on the right-hand side can be measured. Using f2.30t and (232), 
we obtain 


(2.34) 


= - 

qn 


The Hall coefficient is a material parameter, because we will get different numbers for R H if we do 

experiments with identical magnetic fields and current densities, but with different materials. The 
experimental values for -l/(qn R H ) in some metals are given in Table 2.1 . 

Table 2.1 Hall coefficients of selected metals. 


Metal Valence -l/(nq R H ) 


Li 1 
Na 1 
K 1 


- 0.8 

- 1.2 

- 1.1 

-1.5 

-1.3 


Cu 1 
Ag 1 





















Au 1 -1.5 

Be 2 0.2 

Mg 2 0.4 

In 3 0.3 

A1 3 0.3 

For alkali metals the agreement between theory and experiment is nearly perfect. The measured 
Hall coefficient R H is negative for most metals. This can be understood by assuming that the charge 

carriers are “electrons” having a negative charge q = -e. However, there are exceptions. As we see 
in Table 2.1 . Al, Be and others exhibit positive Hall coefficients. This can be explained only by 
assuming that in these metals the main charge carriers are “holes” possessing a positive charge q = 
+e. This is a quantum many-body effect. As we shall see later, the existence of “electrons” and 
“holes” is closely connected to the curvature of the Fermi surface. Nonmagnetic metals which have 
“holes” tend to be superconductors, as will be explained later. 


2.5 The Boltzmann Equation 


In the method of a Boltzmann equation the qualitative arguments in the simple kinetic theory will be 
formulated in more precise terms. In some simple cases, the description of the transport phenomena 
by this method is exact. In more complicated cases this method is an approximation, but it is widely 
used. 

Let us consider the electron-inpurity system, a system of free electrons with uniformly distributed 
inpurities. We introduce a momentum distribution Junction cp(p, t) defined such that (p(p, t)d 3 p gives 
the relative probability of finding an electron in the element d 3 p at time t. This function will be 
normalized such that 

( 2 . 35 )( 2 ^/ dV ''’’"-?-"- 

The electric current density j is given in terms of q> as follows: 

The function (p can be obtained by solving the Boltzmann equation, 
following manner. 

The change in the distribution function (p will be caused by the force 
element d 3 p and by collision. We may write this change in the form: 


which may be set up in the 
acting on the electrons in the 


( 237 )^ 


d if 

dT 


force 


+ d7 


collision 


The force term d(p/dt| force , caused by the force - eE acting on the electrons can be expressed by 


=_ 6 E 

(2.38) * w dp ' 

If the density of inpurities, n I? is low and the integration between electron and inpurity has a short 

range, the electron will be scattered by one inpurity at a time. We may then write the collision term in 
the following form: 










dip 

(239} 


collision 


J m 


where dQ is the solid angled in a spherical coordinate system and I(|p|, 0) is the differential cross 
section. In fact, the rate of collision is given by (density of scatterers) x (speed) x (total cross section). 
If we apply this rule to the flux of particles with momentum p, we can obtain the second integral of 
12.39) . the integral with the minus sign. The integral corresponds to the loss of the flux due to the 
collision (see Figure 2.4a ). The flux of particles with momentum p can gain by inverse collision, 
which is shown in Figure 2.4b . The contribution of the inverse collision is represented by the first 
integral. 


Figure 2.4 In (a), the electron suffers a change in momentum from p to p' after scattering. The inverse 
collision is shown in (b). 




So far, we have neglected the fact that electrons are fermions and are therefore subject to the Pauli 
exclusion principle. We will now look at the effect of quantum statistics. 

If the final momentum state p' was already occupied, then the scattering from state p to state p' 
should not have occurred. The probability of this scattering therefore should be reduced by the factor 
1 - cp(p', t), which represents the probability that the final state p' is unoccupied. Consideration of the 
exclusion principle thus modifies the Boltzmann collision term given in (239} to 


dtp 

di 


coUision 




P2.40) 

When P2.40) is expanded, the two terms proportional to cp(p', t)(p(p, t) in the curly brackets cancel 
each other out. We then have the same collision term as given by (239} . 

Gathering the results from (231) through (239}, we obtain 


dy(iM) 

dt 


4- eE 


d<P(P.t) 


ni f d£2— 
J m 


(2AX} dt ' - Bp 

This is the Boltzmann equation for the electron-impurity system This equation is linear in (p, and 
much simpler than the Boltzmann equation for a dilute gas. In particular, we can solve (2A1} by 
elementary methods and calculate the conductivity a. We will do this in the next section. For simple 
forms of the scattering cross section, we can also solve 12.41) as an initial value problem (Problems 
2.5.1 and 2.5.2). 























As mentioned at the beginning of this section, the Boltzmann equation is very important, but it is an 
approximate equation. If the inpurity density is high or if the range of the interaction is not short, then 
we must consider simultaneous scatterings by two or more inpurities. If we include the effect of the 
Coulomb interaction among electrons, which has hitherto been ignored, the collision term should be 
further modified. It is, however, very difficult to estimate appropriate corrections arising from these 
various effects. 

Problem 2.5.1. Obtain the Boltzmann equation for an electron-impurity system in two dimensions, 
which maybe deduced by inspection, from (2.411 . Assume that all electrons have the same velocity at 
the initial time t = 0. Further, assuming no electric field (E = 0) and isotropic scattering (I = constant), 
solve the Boltzmann equation. 

Problem 2.5.2. Solve the Boltzmann equation 12.411 for three dimensions with the same condition as 
in Problem 2.5.1. Define the Boltzmann H-function by H(t) = (1/n) J d 3 pcp(p, t) ln(cp(p, t)/n). Evaluate 
dH/dt. Plot H(t) as a function of time. 


2.6 The Current Relaxation Rate 

Let us assume that a small constant electric field E is applied to the electron-inpurity system and that 
a stationary homogeneous current is established. We take the positive x-axis along the field E. In the 
stationary state, <3(p/<9t = 0: the distribution function (p depends on momentum p only. From (2.411 the 
Boltzmann equation for cp is then given by 

- e£ ^r “ m f ia ^ ■ 

We wish to solve this equation and calculate the conductivity o. 

In the absence of the field E, the system, by assumption, is characterized by the equilibrium 
distribution function, that is, the Fermi distribution function for free electrons: 

r2 4 3|W(P)-/(^)- e j„A + 1 - 

With the small field E, the function cp deviates from (p 0 . Let us regard <p as a function of E and expand 
it in powers of E: 

<?{p) = + <(h + ■ ■■ 

(2.441 = f( £ p) + + ■ ■ ■ - 

where the subscripts denote the order in E. For the determination of the conductivity a we need qq 
only. Let us introduce (2.441 in (2.421 . and compare terms of the same order in E. 

In the zeroth order we have 

(2.45) m / dQ |p= 0 ■ 

Since the energy is conserved in each scattering, 

(2.46) £ p' = £ p > 

<P 0 (P) = f(£p) clearly satisfies (2.451 . In the first order in E, we obtain 














( 2 . 47 ) 


tyx <k P 


m df. 


Therefore, we obtain from I2.42I 
<7.ak = ■ 

For the moment, let us neglect the first term on the rhs. In this case, the rhs equals —(n x /m)|p|cpi(|p|) 1 
d to I. Then, the function cpjdpl) is proportional to p x . Let us now try a solution of the form: 


12.491 <iM|p|) = - 

where 0(e p ) is a function of s p (no angular dependence). Substitution of 12.491 into 12.481 yields 

^2 ~ eEp *J^ = ni J ■ 

Let us look at the integral on the rhs. We introduce a new frame of reference with the polar axis (the z- 
axis) pointing along the vector p as shown in Figure 2.5 . The old positive x-axis, which is parallel to 
the electric field E, can be specified by the angle (0, (p). From the diagram we have 


Figure 2.5 A new frame of reference in which the positive z-axis points in the direction of the fixed 
vector p. In this frame, the direction of the electric field E is specified by (0, c|)) and that of the 
momentum p' by (%, 4>i). 



(2.51) p* = |p|«>s0 = pcose . 

The vector p' can be represented by (p, %, (jq). If we denote the angle between p' and E by c|>, we 
have 

(2.52) p* =p. 

To express cos (j) in terms of the angles (0, (j)) and (x, (j>i), we use the vector decomposition property 
and obtain 

/ p 

cos v-' --— 

p E 

— (f sin^ cos#! -I- j sin % sin 4- fccosx) 

■ (i sinf) cos# 4- j sin 8 sin # -I- k cos 8) 

= sin % sin 8 [cos #i cos <j> 4 sin sin #] -I- cos % cos 8 


(2.53) cos # = sin# sin 6 cos(# — #i) -I- cos* cos 8 . 

Let us now consider the first integral in f2.50f : 

















A=m JdQpI(p,x)<P{£ P ){pcasy?) 

lit n 

= K X / d0 ‘/ sinxp 2 1{p, %)&(£ p }[sitix sin 6 cos(0 - fa) 


(2.54) 

Since 


4 0 

-1- cos % cos Q] 


2 it 


/ d 0 t cos(<p — ^i) = 0 , 

(2.55) 8 

the first integral can be dropped. We then obtain 


2 x 


A= pti J &pi J dx sin xpl{p,x)^sx[p cos e*P{£ p )} 

0 4 

(2 56) = n ^ ni f d£2p Z <P- x ) cos % ■ 


The second integral in (2.50) . therefore, is proportional to cp(p). We thus obtain the solution: 

. i_ u Px df 1 

( 2 . 57 ) f ' {P) 6 » r{ P ) ’ 
where T(p) is given by 

^2 ^ r (p) = j do .P J (tMr)l 1 - CDS Zl >o. 

The T here is positive and depends only on p = |p| (or equivalently on the energy Sp); it has the 

dimension of frequency and is called the energy-dependent current relaxation rate or simply the 
relaxation rate. Its inverse is called the relaxation time. 

The electric current density j x can be calculated from 

— 2 e 


(2.59) J * 


j tfppMp) 


We introduce 9 = cp 0 + 9i + ••• i n this expression. The first term gives a vanishing contribution (no 
current in equilibrium). The second term yields, using 12.551 . 


(2.60) 


j* = 


2 e 2 E 


(23iftp m 




pi 


d/ 




Comparing this with Ohm’s law, j x = a E, we obtain the following expression for the conductivity: 


2 e 2 


a = 


[#, d (_M. \ 4 t [a>„ 1 M. 

J P r{p)\ is.) 1 (2ji*)> ) P Tip) is, ’ 


12.611 ~ ( 2 xfipm 2 

where we used s = (p x 2 + p y 2 + p z 2 )/(2m). A few applications of this formula will be discussed later. 

Problem 2.6.1. For a classical hard-sphere interaction, the scattering cross section I is given by 
l/2a 2 , where a is the radius of the sphere. Evaluate the relaxation rate T(p) given by 12.581 . Using this 
result, calculate the conductivity a through 12.611 . Verify that the conductivity calculated is 
temperature-independent. 

Problem 2.6.2. Formula 12.611 was obtained with the assumption that the equilibrium distribution 
function in the absence of the field is given by the Fermi distribution function 12.431 . 

1. Verify that the same formula applies when we assume that the equilibrium distribution function 
is given by the Boltzmann distribution function. 


















2. Show that the conductivity calculated by this formula does not depend on the temperature. 
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Chapter 3 


Bloch Electron Dynamics 


To properly develop a microscopic theory of the conduction a deeper understanding of the properties 
of normal metals than what is provided by the free-electron model is required. On the basis of the 
Bloch theorem, the Fermi liquid model is derived. At 0 K, the normal metal is shown to have a sharp 
Fermi surface, which is experimentally supported by the fact that the heat capacity is linear in 
temperature at the lowest temperatures. Electrons and holes, which appear in the Hall effect 
measurements, are defined in terms of the curvature of the Fermi surface. Newtonian equations of 
motion for a Bloch electron (wavepacket) are derived and discussed. 


3.1 Bloch Theorem in One Dimension 

Let us consider a periodic potential y(x) in one dimension, see Figure 3.1 . that satisfies 

Figure 3.1 Aperiodic potential in one dimension. The blackened circles (•) indicate lattice ions. 

a 

—O-•-» 9 * X 



(3.1) -I- na) = V(je), — oo < x < oo 

where a is the lattice constant and n is an integer. 

The Schrodinger energy-eigenvalue equation for an electron is 

Clearly the wavefunction \|/ E (x + na) also satisfies the same equation. Therefore, \|/ E (x + na) is 
likely to be different from \|/ E (x) only by an x-independent phase: 

I3.3I + na) = , 

where k is real, see below. Equation 13.31 represents a form of the Bloch theorem [1]. It generates 
far-reaching consequences in the theory of conduction electrons. Let us prove (33). Since \|/(x) and 
v)/(x + na) satisfy the same equation, they are linearly dependent: 

13.41 + na) - c{na)y{x ). 

Using I3.4I twice, we obtain 


rp{x-\-na-\-ma) = c{na)ty{x-\-ma) = c{na)c{ma)^{x) = c{na-\-ma)\p{x ). 

Since the wavefunction \|/(x) does not vanish in general, we obtain 













(3.5) c i na 4- ma) = c(na)c(ma) or c(x 4- y} = c{x)c{y). 

Solving this functional equation, we obtain (Problem 3.1.1) 

13.61 c(y) = exp^y) > 

where A, is a constant. Because the wavefunction \\t in (3.41 must be finite for all ranges, constant A 
must be a pure imaginary number: 


t3.7U = ifc> 

where k is real. Combining t3.4L t3.6L and (3.7) we obtain (33). 

Let us discuss a few physical properties of the Bloch wavefunction v|/. By taking the absolute square 
of (33), we obtain 


(3.8) 


| V >(* + n «)\ 2 = 


The following three main properties are observed: 

(a) The probability distribution function P(x) = |\[/(x)| 2 is lattice periodic: 


(3.9) = p ( x 4- na), for any ft. 

(b) The exponential function of a complex number exp(iy) (y real) is periodic: exp(i(y + 2n m)) = 
exp(iy), where m is an integer. We may choose the real number k in (33), called the k-number (2 n 
times the wavenumber), to have a fundamental range: 

ji Jt 

(3.10) “7- ~ 7 ; 

the two end points (~n/a, n/a ) are called the Brillouin boundary (points). This fundamental 
range for k is called the first Brillouin zone. 

(c) In general, there are a number of energy gaps (forbidden regions of energy) in which no 
solutions of 13.21 exist (see Figure 3.4 and Section 3.2). The energy eigenvalues E are 
characterized by the k-number and the zone number (or band index) j, which enumerates the 
energy bands: 


(3.11)* = W- 

This property (c) is not obvious, and it will be illustrated by examples in Section 3.2. 
To further explore the nature of the Bloch wavefunction \|/, let us write 
13.121 tM*} = WjyfcM ) - 

and substitute it into (33). If the function Uj^(x) is lattice periodic, 


(3.13) + na) = Uj^x) t 

then t3.3f is satisfied (Problem 3.1.2). Equation 13.121 represents a second form of the Bloch 
theorem The Bloch wavefunction \|/(x) = Uj^(x) exp(ikx) has great similarity with the free-particle 

wavefunction: 


(3.14) V J fr e d*) = cexp(ifex), 

where c is a constant. The connection may be illustrated as shown in Figure 3.2 . For the free particle, 
the k-number can range from -oo to go, and the energy is 


Figure 3.2 Variation of the real (or imaginary) part of the wavefunction v|/ E (x). 
























(3.15) ree 2 m 


2 m 


with no gaps. This feature is different from the properties (b) and (c) in the list above. 

An important similarity arises when we write the time-dependent wavefimction \|/(x, t) in the 
running wave form: 

13.161 = t/{x}eKp{i(bf— on)), 

where the frequency co is defined by 

I A -1 E: (t) for the Bloch electron, 
fi 1 Efrez for the free electron , 

and the amplitude U(x) is defined by 

Uj k(x) for the Bloch electron, 


(3.18) 


t/(x) = 


for the free electron , 


Equation 13.161 shows that the Bloch wavefimction v)/ E (x) represents a running wave characterized by 
k-number k, angular frequency cd, and wave train u- ] k . 

The group velocity v of the Bloch wavepacket is given by 
Scy h dE 

(3 i 19) l '"0t a*' 

By applying the (quantum) principle of the wave-particle duality, we say that the Bloch electron 
moves with the dispersion (energy-momentum) relation: 

(3.20) E = = £ 0h- 

The velocity v is given by ( 3.19k This gives a picture of great familiarity. We generalize our theory to 
the three-dimensional case in Section 3.3. 

Problem 3.1.1. Solve the following functional equations: 

(i) f(x + y) = f(x) + f(y) 

(ii) f(x + y) = f(x) f(y) 

Hints: Differentiate (i) with respect to x, and convert the result into an ordinary differential 
equation. Take the logarithm of (ii), and use (i). Answer: (i) f(x) = ex, (ii) f(x) = e^ x , where c and X 
are constants. 

Problem 3.1.2. 

1. Show that Bloch’s wavefimction 13.121 satisfies 13.31 . 

2. Assuming (33), derive (332). 


3.2 The Kronig-Penney Model 

The Bloch energy-eigenvalues in general have bands and gaps. We show this by taking the Kronig- 
Penney (KP) model [8], Let us consider a periodic square-well potential y(x) with depth V 0 (< 0) and 

well width (3(= a - a) as shown in Figure 3.3 : 

Figure 3.3 A Kronig-Penney potential has a square-well with depth V 0 and width P periodically 
arranged with period a = a + p. 
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1 0 If na < x < na 4- a, 

Vb if na-/) <i'< na, 

where a = a + p and the n are integers. 

The Schrodinger energy-eigenvalue equation for an electron can be written as in 13.21 . Since this is 
a linear homogeneous differential equation with constant coefficients, the wavefimction \|/(x) should 
have the form 

(3.22) ${ x ) = c esp{yj;} > ( c j Y are constants). 

According to the Bloch theorem in 13.121 . this function \[/(x) can be written as 

(3.23) = WfcM eKp(ifex), 

(3.24) •+*(*) = i*k{x +a). 

The condition that the function \|/(x) be continuous and analytic at the well boundary yields the 
following relationships (Problem 3.2.1): 

/0 cos(tfl) = cosh^^costMfl + L-P-smh{K a )smfrp)= f{E) 

( i.Zj) z 


Z r 2 




E = 






na < x < na + a , 

na — ft < x < na p \^<0. 


13.261 

By solving 13.251 with 13.261 . we obtain the eigenvalue E as a function of k. The band edges are 
obtained from 

(3.27) /(£) =±i, 

which corresponds to the limits of cos(ka). Numerical studies of 13.251 indicate that (i) there are, in 
general, a number of negative- and positive-energy bands; (ii) at each band edge, an effective mass 
m* can be defined, whose value can be positive or negative and whose absolute value can be greater 
or less than the electron mass m; and (iii) the effective mass is positive at the lower edge of each 
band, and it is negative at the upper edge. A typical dispersion relation for the model, showing energy 
bands and energy gaps, is shown in Figure 3.4 . At the lowest band edge 8 0 we have 

(3.28) /Po) = l ■ 

Near this edge the dispersion (energy-k) relation calculated from 13.251 is (Problem 3.2.2) 

A 2 k 2 

(3.29) e = f °" 1 ’2m3 r (e<t<0) ' 

13.301 = -6 2 « _2 /"(£ 0 ). 


Figure 3.4 E-k diagram showing energy bands and forbidden gaps. 
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This one-dimensional KP model can be used to study a simple three-dimensional system Let us 
take an orthorhombic (ore) lattice of unit lengths (a 1? a 2 , a 3 ), with each lattice point representing a 

short-range attractive potential center (ion). The Schrodinger equation (3.2) for this system is hard to 
solve. 

Let us now construct a model potential y defined by 


(3.31) V{x,y,z) = V\(x) + V,(y) + V 3 <z) 


V s {u) = 



if H-Hj — 0 < W < H-Hj 

otherwise. 


(3.32) 

Here the n are integers. A similar two-dimensional model is shown in Figure 3.5 . The domains in 
which y ^ 0 are parallel plates of thickness p (< aj) separated by aj in the direction Xj, (x 1? x 2 , x 3 ) = (x, 

y, z). The intersection of any two plates are straight beams of cross section p 2 , where the potential y 
has the value 2V 0 . The intersections of three plates, where the potential y has the value 3 V 0 , are cubes 

of side length p. The set of these cubes form an ore lattice, a configuration similar to that of the 
commercially available molecular lattice model made up of balls and sticks. Note: Each square-well 
potential yj has three parameters (V 0 , p, aj), and this model represents the true potential fairly well 

[12]. The Schrodinger equation for the 3D model Hamiltonian 


Figure 3.5 A 2D model potential. Each singly shaded stripe has a potential energy (depth) V 0 . Each 
cross-shaded square has a potential energy 2V 0 . 
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(3.33) 

can now be reduced to three one-dimensional KP equations. We can then write an expression for the 
energy of our model system near the lowest band edge as 





























E = — k 2 -I- — k 2 
(3.34) 2 mi K * + 2m 2 K ’ 


2 HI; 


-I- -— It* -|- constant, 


a r 


where {rr^} are effective masses defined by (330) with a 

Equation (3.34) is identical to what is intuitively expected of the energy-k relation for the electron 
in the ore lattice. It is stressed that we derived it from first principles, assuming a three-dimensional 
model Hamiltonian Our study demonstrates qualitatively how electron energy bands and gaps are 
generated from the Schrodinger equation for an electron moving in a three-dimensional lattice space. 
Problem 3.2.1. Derive 13.25) . Hint: The wavefunction \\i and the energy eigenvalue E are given by 


^ = 


(Ae 1 ** -I- Be lKsf , E = —h 2 K 2 f{2m) if na < x < na + a 


Ce"* -I- De - "* , E = V b + ti 2 fi 2 /{2m) i£na-0 <x <na . 

Use \|/ and d\|//dx are continuous at the potential boundaries, say, x = 0 and x = a 

Problem3.2.2. Derive (3.30) . 


3.3 Bloch Theorem in Three Dimensions 


Let us take a monovalent metal such as sodium (Na). The Hamiltonian ^ of the system may be 
represented by 


* p 2 _L J 



a=l 


n 

2 M 


13.35) 


+EE 

a>y 


\R a — Ry 


EE 


t 0 e 2 


where kg = (4tis 0 ) l . The sums on the rhs represent, respectively, the kinetic energy of electrons, the 


interaction energy among electrons, the kinetic energy of ions, the interaction energy among ions, and 
the interaction energy between electrons and ions. The metal as a whole is electrically neutral, and 
hence the number of electrons equals the number of ions. Both numbers are denoted by N. 

At the lowest temperatures the ions are almost stationary near the equilibrium lattice points. 
Because of quantum zero-point motion, the ions are not at rest even at 0 K. But this fact does not 
affect the following argument. The system then can be viewed as the one in which the electrons move 
in a periodic lattice potential y. The Hamiltonian of this idealized system that depends on the electron 
variables only can be written as 


* P 2 i 


he? 






where y(r) represents the lattice potential, and the constant energy C depends on the lattice 
configuration. 

Let us drop the Coulomb interaction energy from (336). We then have 




«-Ej^+E v i r J> +c - 

(3.37) i 2m i 

For definiteness, we consider an infinite ore lattice. We choose a Cartesian frame of coordinates (x, 
y, z) along the lattice axes. The potential y(x, y, z) = y(r) is lattice periodic: 


(3.38) V<r} = V(r+fi} 














(3.39) * = + n 2 hj + njcjfc , (rtj: integers) 

where (a, b, c) are lattice constants and the vector R is called the Bravais lattice vector.^ The 
Schrodinger equation for an electron is 

n 4o> [~^ y2+V|r| ] ** (r) ~ • 

where s is the energy. Clearly \|/ k (r + R) also satisfies the same equation. The values of the 
wavefunction v|/ k at r and r + R may therefore be different only by a r-independent phase: 


041 ) 




where k = (k x , ky, E,) is called a k-vector. Equation f3.4If represents a form of Block’s theorem [1, 
2]. It generates far-reaching consequences in the theory of conduction electrons. 

The three principal properties of the Block wavefunctions are: 

1. The probability distribution P(r) is lattice periodic: 


(3.42) P(r) = hM r )| 2 =i>{r-Hi). 

2. The k- vector k = (k x , ky, k z ) in I3.41I has the fundamental range: 
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The end points that form a rectangular box, are called the Brillouin boundary. 

3. The energy eigenvalues s have energy gaps, and the allowed energies s can be characterized by 
the zone number j and the k-vector k: 


f3.441 e = = £ j(p) ■ 

Using f3.41h we can express the Bloch wavefunction v|/ in the form: 


a 45 ) 


tft(r) = fiM r ) = exp(ife ■ r) 


where u(r) is a periodic function: 


Uj,*(r +R) = Uj^r) 


046) 

Formula I3.45I along with I3.46I is known as the Bloch theorem [1,2]. The wavefunction i|/ k (r) in the 

form 13.451 resembles a plane wave describing the motion of a free particle, but here the wave is 
modulated by a periodic function u k (r). The k-vector k is connected with the momentum p by p = hk. 

Equations f3.44W3.46I indicate that the Bloch wavefunction v|/ k (r), associated with quantum 
numbers (j, k), is a plane wave characterized by k- vector k, angular frequency oj = Sj (hk)/h and wave 
train Uj k (r). It is clear from 13.45) that 


(3.47) = Kddl 2 = |Vj*(r + «)| 2 . 

Hence, the probability is lattice periodic. The Bloch theorem holds in 3D or 2D only if the Bravais 
vector is expressed in the Cartesian frame of reference. This is relevant to graphene physics, and will 
be further discussed in Chapter 5, Section 5.2. 

Problem 3.3.1. 

1. Show that the Bloch wavefunction (3 A D satisfies 13.401 . 

2. Assuming f3.41k derive Equations f3.451 and f3.46T 

























3.4 Fermi Liquid Model 

We consider a monovalent metal, whose Hamiltonian is given by (335): 
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(3.481 ■>1' l Ba_R di j „ l r J — 

The motion of the set of N electrons is correlated because of the interelectronic interaction. If we 
omit the ionic kinetic energy, and the interionic and interelectronic Coulomb interaction from (3.48) . 
we obtain 

pi 

*=-Ej£ + E v < r J> +c ' 

(3.49) j 2m i 

which characterizes a system of electrons moving in the bare lattice potential. 

Since the metal as a whole is neutral, the Coulomb interaction among the electrons, among the ions, 
and between electrons and ions, all have the same order of magnitude, and hence they are equally 
i mportant. We now pick out one electron in the system This electron is interacting with the system of 
N ions and N - 1 electrons, the system (medium) having the net charge +e. These other N - 1 
electrons should, in accordance with Bloch’s theorem, be distributed with the lattice periodicity and 
all over the crystal in equilibrium The charge per lattice ion is greatly reduced from e to e/N because 
the net charge e of the medium is shared equally by N ions. Since N is a large number, the selected 
electron moves in an extremely weak effective lattice potential y eff as characterized by the model 

Hamiltonian: 


p 2 

(3 50) ^ = 2 tit "t" = t^g£f(r). 

In other words any chosen electron moves in an enviro nme nt far different from what is represented by 
the bare lattice potential y. It moves almost freely in an extremely weak effective lattice potential y eff . 

This picture was obtained with the aide of Bloch’s theorem, and hence it is a result of quantum theory. 
To illustrate let us examine the same system from the classical point of view. In equilibrium the 
classical electron distribution is lattice periodic, so there is one electron near each ion. The electron 
will not move in the greatly reduced field. 

We now assume that electrons move independently in the effective potential field y eff . The total 
Hamiltonian for the idealized system may then be represented by 




,, , n «■= ■- E pj > - E sr+E v ^j> ■ 
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This Hamiltonian ^ c is a far better approximation to the original Hamiltonian than the 
Hamiltonian ^ B . In both interelectronic and interionic Coulomb repulsion are not neglected but are 

taken into consideration self-consistently. This theoretical model is a one-electron-picture 
approximation, but it is hard to improve on by any simple method. The model in fact forms the basis 
for the band theory of electrons. 

We now apply Bloch’s theorem to the Hamiltonian w c composed of the kinetic energy and the 








interaction energy y eff . We then obtain the Block energy bands Sj (hk) and the Bloch states 

characterized by band index j and k-vector k. The Fermi-Dirac statistics obeyed by the electrons can 
be applied to the Bloch electrons with no regard to interaction. This means that there is a certain 
Fermi energy s F for the ground state of the system Thus, there is a sharp Fermi surface represented by 

(3.52) = £ f, 

which separates the electron-filled k-space (low-energy side) from the empty k-space (high-energy 
side). The Fermi surface for a real metal in general is complicated in contrast to the free-electron 
Fermi sphere represented by 

p 1 _ pI+j$ + pI 

(3.53) 2m 2m ' 

The independent electron model with a sharp Fermi surface at 0 K is called the Fermi liquid model of 
Landau [3]. As we show later, many thermal properties of conductors are dominated by those 
electrons near the Fermi surface. The shape of the Fermi surface is very important for the conduction 
behavior. In the following section, we shall examine the Fermi surfaces of some metals. 

The Fermi liquid model was obtained in the static lattice approximation in which the motion of the 
ions is neglected. If the effect of moving ions (phonons: quanta of lattice vibrations) is taken into 
account, a new model is required. The electron-phonon interaction turns out to be very important in 
the theory of superconductivity, which will be discussed in Chapter 7. 


3.5 The Fermi Surface 

Why does a particular metal exist in a particular crystalline state? This is a good question. The 
answer must involve the composition and nature of the atoms constituting the metal and the interaction 
between the component particles. To illustrate let us take Na, which forms a bcc lattice. This 
monovalent metal may be thought of as an ideal composite system of electrons and ions. The system 
Hamiltonian may be approximated by in t3.48L which consists of the kinetic energies of electrons 

and ions and the Coulomb interaction energies among and between electrons and ions. This is an 
approximation since the interaction between electron and ion deviates significantly from the ideal 
Coulomb law at short distances because each ion has core electrons. At any rate the study of the 
ground state energy of the ideal model favors a fee lattice structure, which is not observed for this 
metal. If multivalent metals like Pb and Sn are considered, the condition becomes even more 
complicated, since the core electrons forming part of the ions have anisotropic charge distribution. 
Because of this complexity it is customary in solid state physics to assume the experimentally known 
lattice structures first, then proceed to study the Fermi surface. 

Once a lattice is selected, the Brillouin zone is fixed. For an orthorhombic (ore) lattice the 
Brillouin zone is a rectangular box defined by t3.43L We now assume a large periodic box of 
volume: 

(3.54) V = {NiflNNi&HNjc) , NtNiNj » 1 . 

Let us find the number N of the quantum states in the first Brillouin zone. With the neglect of the spin 
degeneracy, the number N is equal to the total k-space volume enclosed by the Brillouin boundary 
divided by unit k-cell volume: 
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(3.55) 

which equals the number of ions in the normalization volume. It is also equal to the number of 
conduction electrons in a monovalent metal. Thus, the first Brillouin zone can contain twice (because 
of spin degeneracy) the number of conduction electrons for the monovalent metal. This means that at 0 
K, half of the Brillouin zone may be filled by electrons. Something similar to this actually happens to 
alkali metals including Li, Na, K. These metals form bcc lattices. All experiments indicate that the 
Fermi surface is nearly spherical and entirely within the first Brillouin zone. The Fermi surface of Na 
is shown in Figure 3.6 . 


Figure 3.6 The Fermi surface of Na (bcc) is spherical within the first Brillouin zone. 



The nearly free electron model (NFEM) developed by Harrison [5] can predict a Fermi surface for 
any metal in the first approximation. This model is obtained by applying Heisenberg’s uncertainty 
principle and Pauli’s exclusion principle to a solid. Hence, it has a general applicability unhindered 
by the complications due to particle-particle interaction. Briefly in the NFEM, the first Brillouin zone 
is drawn for a chosen metal. Electrons are filled, starting from the center of the zone, with the 
assumption of a free electron dispersion relation. If we apply the NFEM to alkali metals, we simply 
obtain the Fermi sphere as shown in Figure 3.6 . 

Noble metals, including copper (Cu), silver (Ag), and gold (Au) are monovalent fee metals. The 
Brillouin zone and Fermi surface of Cu are shown in Figure 3.7 . The Fermi surface is far from 
spherical. Notice that the Fermi surface approaches the Brillouin boundary at right angles. This arises 
from the mirror symmetry possessed by the fee lattice. 


Figure 3.7 The Fermi surface of Cu (fee) bulges out in the (111) direction to make contact with the 
hexagonal zone faces. 













For a divalent metal like calcium (Ca) (fee), the first Brillouin zone can in principle contain all of 
the conduction electrons. However, the Fermi surface must approach the zone boundary at right 
angles, which distorts the ideal configuration considerably. Thus, the real Fermi surface for Ca has a 
set of unfilled corners in the first zone, and the overflow electrons are in the second zone. As a result 
Ca is a metal, and not an insulator. Besides Ca has electrons and holes. Divalent beryllium (Be) 
forms a hexagonal closed packed (hep) crystal. The Fermi surfaces in the second zone constructed in 
the NFEM and observed [7], are shown in Figure 3.8a and b . respectively. Let us now consider 
trivalent aluminum (Al), which forms a fee lattice. The first Brillouin zone is entirely filled with 
electrons. The second zone is half filled with electrons, starting with the zone boundary as shown in 
Figure 3.9 . For a more detailed description of the Fermi surface of metals, see standard texts on solid 
state physics [2, 6, 8]. Al and Be are superconductors, while Na and Cu are not. 

Figure 3.8 The Fermi surfaces in the second zone for Be. (a) NFEM “monster,” (b) measured 
“coronet.” The coronet encloses unoccupied states. 


(b) 

Figure 3.9 The Fermi surface constructed by Harrison’s model (NFEM) in the second zone for Al. 
The convex surface encloses vacant states. 



3.6 Heat Capacity and Density of States 

The band structures of conduction electrons are quite different from metal to metal. In spite of this, the 
electronic heat capacities at very low temperatures are all similar, which is shown in this section. We 
first show that any normal metal having a sharp Fermi surface has a T-linear heat capacity. We draw 
the density of states £i(e), and the Fermi distribution function f(e) as a function of the kinetic energy e 
in Figure 3.10 . The change in f(s) is appreciable only near the Fermi energy s F . The number of excited 

electrons, AN, is estimated by 
















Figure 3.10 The density of states in energy, p(e), and the Fermi distribution function f(s) are drawn as 
a function of the kinetic energy s. The change in f is appreciable only near the Fermi energy s F if k B T 

<sc s F . The shaded area represents approximately the number of thermally excited electrons. 
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(3.56) AN = E< eF )fc H r. 

Each thermally excited electron will move up with an extra energy of the order k B T. The approximate 
change in the total energy A E is given by multiplying these two factors: 

(3.57) AE = A Nk B T = V(£y)(i bT) 2 . 

Differentiating this with respect to T, we obtain an expression for the heat capacity: 

(3 58) Cv ~ 9T A£ = , 

which indicates the T-linear dependence. This T-dependence conies from the Fermi distribution 
function. Using the more rigorous calculation, we obtain 

(3 59) = ■ 

The density of states, i?(b f ), for any 3D normal metal can be evaluated from 



where the factor 2 is due to the spin degeneracy, and the surface integration is carried out over the 
Fermi surface represented by 

(3.61) £;(**> = £f. 

As an illustration, consider a free electron system having the Fermi sphere: 
pi + p ? + pI 

(3.62) £_ 2iS £f ‘ 

The gradient V s(p) at any point of the surface has a constant magnitude p F /m, and the surface integral 
is equal to 47ip F 2 . Equation f3.60T then, yields 
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As a second example, consider the ellipsoidal surface represented by 


n 2 n £ „2 

e = 2*- 4 - + 2Ll- 

(3.64) 2m! 2 m 2 2m 3 


After elementary calculations, we obtain (Problem 3.6.1) 


oea V{c) ' • 

which shows that the density of states still grows like s 1/2 , but the coefficient depends on the three 
effective masses (m,, m 2 , m 3 ). 

Problem 3.6.1. 

1. Compute the momentum-space volume between the surfaces represented by s = p x 2 /2m 1 + 
p y 2 /2m 2 + p z 2 /2m 3 and s + ds = p x ' 2 /2m 1 + p y ' 2 /2m 2 + p z ' 2 /2m 3 . By counting the number of 
momentum states in this volume in the bulk limit, obtain (3 j65). 

2. Derive (3j65), starting from the general formula 13.601 . Hint: Convert the integral over the 
ellipsoidal surface to one over a spherical surface. 


3.7 The Density of State in the Momentum 
Space 

In many applications of quantum statistical mechanics we meet the need for converting the sum over 
quantum states into an integral. This conversion becomes necessary when we first find discrete 
quantum states for a finite box, and then seek the sum over states in the bulk limit. The necessity of 
such a conversion does not arise in the spin problem This conversion is a welcome procedure 
because the resulting integral, in general, is easier to handle than the sum The conversion is of a 
purely mathematical nature, but it is an important step in carrying out statistical mechanical 
computations. 

Let us first examine a sum over momentum states p k = 27ihk/L corresponding to a one-dimensional 
motion. Let us take the sum 


13.661 T 

where A(p k ) is an arbitrary function of p. The discrete momentum states p k = 27thk/L are equally 

spaced, as shown by the short bars in Figure 3.11 . As the normalization length L is made greater, the 
spacing (distance) between two successive states, 27ih/L, becomes smaller. This means that the 
number of states par unit momentum interval increases as L increases. We denote the number of states 
within a small momentum interval A p by An. We now take the ratio, 


Figure 3.11 The linear momentum states are represented by short bars forming a linear lattice with 
unit spacing equal to 27ih/L. As the normalization length L is made greater, the spacing becomes 
smaller. 
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Aft _ number of states in Ap 

(3.67) Ap ~ Ap 

Dividing both the numerator and denominator by the number of states, we get 

An _ _1__ 1 _ I 

(3.68) Ap momentum spacing per state IfthfL 2?rfi 

Note that this ratio An/Ap increases linearly with the normalization length L. 

Let us now consider a quantity 

£ A MT“Aip , 

(3.69) T Ai * 

where Aj p is the 1-th interval and p, represents a typical value of p within the interval A, p, say the p- 
value at the midpoint of Aj p. Since 


= [Aft] = {number), 

the two sums (3.66) and (3.69) have the same dimension. Furthermore, their numerical values will be 
close if (a) the function A(p) is a smooth function of p, and (b) there exist many states in A\ p so that 

An/Aj p can be regarded as the density of states. Condition (b) is satisfied for the momentum states 
{p k } when the length L is made sufficiently large. We can then expect that in the bulk limit, by 
choosing infinitesimally small intervals : Aj p —► d p, expressions (3.66) and (3.69) will have the 
same value. That is, 


(3.70) Ai p 

as L —► oo. But by definition, the sum on the rhs becomes the integral J d p A(p)dn/d p, where 

dn Aft I 

(3.71) <h* Ap 2nh 

In summary, we therefore have 


oo 

dpA(p)— Aip as L-»oo. 

^„ , dp 

It is stressed that condition (a) depends on the nature of the function A. Therefore, if A(p) is 
singular at some point, condition (a) is not satisfied and this may invalidate the limit (3.72) . Such 
exceptional cases do occur. We further note that the density of state dn/dp = L/(27ih) does not depend 
on the momentum 

The sum-to-integral conversion, which we have discussed, can easily be generalized for a 


















multidimensional case. For example, in three dimensions, we have 

MVi) -*■ ( d i pA{p} J D(p) as V = l ?—► oo, 

(3.73) « j 

where 

(3.74) """I? 

is called the density of states per unit volume in momentum space. 

Let us choose a periodic cubic box of side length L for the normalization. The density of states 
dn/d 3 p can then be calculated by extending the arguments leading to (3 j 68). The result is given by 

1 L J 

(3.75) V{p) = (2*fe/L) 3 = ■ 

For electrons, the spin degeneracy doubles this density. We therefore obtain 
d n 2 V 

(3.76) w " d* “ (2*tf 

with spin degeneracy. 

Let us now use this result and simplify the normalization condition, which relates the chemical 
potential p with the average density n: 

^ » - y = - 

where (n K } gc denotes a grand canonical ensemble average of the number operator n K , resulting in the 
Fermi distribution function f(e K ).^ Note that k denotes the set of states and s K the single-electron 
energy associated with state k and the symbol “Lim” indicates the bulk limit. By choosing A(p) = 
f(p 2 /2m), we obtain 
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(3.78) (2^) J 

As a second example, we take the energy density of the system The total energy of the many-particle 
system is given by 

E = E - 

( 5.ly) k k 

where e K = p K 2 /(2m) is the kinetic energy of the electron with momentum p K . From f3.79L we obtain 

the energy density e = E/V: 

El 1 ^ 
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(3.80) 






Equations (3.79) and (3.78) were obtained starting with the momentum eigenvalues corresponding to 
the periodic boundary condition. The results in the bulk limit, however, do not depend on the choice 
of the boundary condition. 

The Fermi energy s F , by definition, is the chemical potential p 0 at absolute zero. We may look at 
this connection as follows. For a box of finite volume V, the momentum states are quantized. As the 
















volume V is increased, the unit cell volume in momentum space, (27ih) 3 /V decreases like W 1 . 
However, in the process of the bulk limit we must increase the number of electrons, N, in proportion 
to V Therefore, the radius of the Fermi sphere within which all momentum states are filled with 
electrons neither grows nor shrinks. Obviously, this configuration corresponds to the lowest energy 
state for the system The Fermi energy p 0 = p F 2 /(2m) represents the energy of an electron with the 
momentum magnitude p F at the surface of the Fermi sphere. If we attempt to add an extra electron to 
the Fermi sphere, we must bring in an electron with an energy equal to p 0 . 

Problem 3.7.1. The momentum eigenvalues for a particle in a rectangular box with sides of unequal 
length (fy, Ly, are given by 

2jiA 2jifi _ 2nh, 

P*j = -j—J , P Y ,k = -=—k , Pzj = ——l . 

L * Ly L z 

Assuming that the particles are fermions, verify that the Fermi energy s F is the same, independent of 
the box shape, in the bulk limit. 

3.8 Equations of Motion for a Bloch Electron 

In this section we discuss how conduction electrons respond to the applied electromagnetic fields. 
Let us recall that in the Fermi liquid model each electron in a crystal moves independently in an 
extremely weak lattice periodic effective potential y eff (r): 

(3.81) V e ff(r+ ft) = Veff(r) » V(r) . 

We write down the Schrodinger equation: 

V 2 -1- V{r)l #i(r) = evM r ) ■ 

(3.82) 1 .2 m 'Y 

According to Bloch’s theorem, the wavefunction v|/ satisfies 

(3.83) ^(r+«) = e ll ’ IE ^(r). 

The Bravais vector R can take on only discrete values, and its minimum length can equal the lattice 
constant a 0 . This generates a limitation on the domain in k. For example the values for each fyf a = x, 

y, z) for a sc lattice are limited to [—7r/a 0 ,7i/a 0 ]. This means that the Bloch electron’s wavelength X = 
27 i/k has a lower bound: 

(3.84) A>2 3o . 

The Bloch electron state is characterized by k-vector k, band index j and energy 
G.85I £ = £ j = Mp) ■ 

Here we defined the lattice momentum by p = hk. The energy-momentum (or dispersion) relation 
represented by f3.85I can be probed by transport measurements. A metal is perturbed from the 
equilibrium condition by an applied electric field; the deviations of the electron distribution from the 
equilibrium move in the crystal to reach and maintain a stationary state. The deviations, that is, the 
localized Bloch wavepackets, should extend over one unit cell or more. This is so because no 
wavepackets constructed from waves whose wavelengths have the lower bounds 2a 0 can be localized 

within distances less than a 0 . 




Dirac demonstrated [9] that for any p-dependence of the kinetic energy (s = e ( (p)) the center of a 

quantum wavepacket, identified as the position of the corresponding particle, moves in accordance 
with Hamilton’s equations of motion. Hence, the Bloch electron representing the wavepacket moves 
in the classical mechanical way under the action of the force averaged over the lattice constants. The 
lattice force -dy eff /dx averaged over a unit cell vanishes: 

00 

ff dydz [ dx^-Vesix^.z) = 0. 

(3 86) ' 6X /unit oall JJ J dX 

Thus, the only important forces acting on the Bloch electron are the electromagnetic forces. 

We now formulate dynamics for the Bloch electron as follows. First, from the quantum principle of 
wave-particle duality, we postulate that 

(3.87) = to*,Pi,Pz) = ipi.?2,pi) = p ■ 

Second, we introduce a model Hamiltonian, 

(3.881 P 2 , Pj) = . 

Third, we generalize our Hamiltonian ^ to include the electromagnetic interaction energy: 

(3.89)H = 7*o<|>-sA) + g0, 

where (A, cj)) are vector and scalar potentials generating electromagnetic interaction energy: 

(3 90) E = _v ^ r ’ - B = v x *) - 

where r = (x 1? x 2 , x 3 ). By using the standard procedures, we then obtain Hamilton’s equations of 
motion: 
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The first equation defines the velocity v = (v 1? v 2 , v 3 ). Notice that in the zero-field limit these 
equations are in agreement with the general definition of a group velocity: 

doi(k) 

(3.92) dh 

de{p) , ., . 

vi = - (particle picture}. 

(3.93) 9pi 

The first of (3.911 gives the velocity v as a function of p - qA. Inverting this functional relation, we 
have 


(3.941 p - gA = f[v). 

Using (3.91W3.94) . we obtain 
a25J i£-,(E + ,xS>. 

Since the vector f is a function of the velocity v, (3.951 describes how v changes by the action of the 
Lorentz force (the right-hand term). 

To see the nature of (3.95k we take a quadratic dispersion relation represented by 
c pi i £ i d i 

(3.961 

where {nij*} are effective masses and 8 0 is a constant. The effective masses {nij*} may be positive or 















negative. Depending on their values, the energy surface represented by (3.961 is ellipsoidal or 
hyperboloidal. See Figure 3.12 . If the Cartesian axes are taken along the major axes of the ellipsoid, 
(3.951 can be written as (Problem 3.8.1) 

Figure 3.12 (a) Ellipsoid, (b) hyperboloid of one sheet (neck), (c) hyperboloid of two sheets 
(inverted double caps). 



These are Newtonian equations of motion: mass x acceleration = force. Only a set of three effective 
masses {mj*} are introduced. The Bloch electron moves in an anisotropic environment if the effective 
masses are different. 

Let us now go back to the general case. The function f may be determined from the dispersion 
relation as follows: take a point A at the constant-energy surface represented by (3.851 in the k-space. 
We choose to point the positive normal vector in the direction in which energy increases. A normal 
curvature k is defined as the inverse of the radius of the contact circle at A (in the plane containing the 
normal vector) times the curvature sign S A : 

(3.98) * = - 

where S A is +1 or -1 according to whether the center of the contact circle is on the positive side 

(which contains the positive normal) or not. In space-surface theory [11], the two planes that contain 
the greatest and smallest normal curvatures are known to be mutually orthogonal. They are, by 
construction, orthogonal to the contact plane at A. Therefore, the intersections of these two planes and 
the contact plane can form a Cartesian set of orthogonal axes with the origin at A, called the principal 
axes of curvatures. By using this property, we define principal masses nij by 

(3.991 = ’ 

where 5pj is the differential along the principal axis i. If we choose a Cartesian coordinate system 
along the principal axes, (3.951 can be written as (Problem 3.8.2) 

d\>i 

Note that these equations are similar to those of (3.971 . The principal masses {mj}, however, are 
defined at each point on the constant-energy surface, and hence they depend on p and Sj(p). Let us take 
a simple example: an ellipsoidal constant-energy surface represented by (3.961 with all positive m^*. 
At extremal points, for example, (p l max , 0, 0) = {^2(£- £ 0 )^, 0 ,0), the principal exes of curvatures 
match the major axes of the ellipsoid. Then the principal masses {rrij} can simply be expressed in 
terms of the constant effective masses {m*} (Problem 3.8.3). 

























The proof of the equivalence between (3.951 and (3.100! is carried out as follows. Since fj are 
functions of (v 1? v 2 , v 3 ), we obtain 

d/i _ y> dfj dVj ^ yi / \ 1 dUj 

(3.101) i 0Vj dt j V d ft) 

The velocities Vj from (3.91) can be expressed in terms of the first p-derivatives. Thus, in the zero- 
field limit: 
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(3.1021 a /f s Pt d Pi - m a ’ 

which defines the symmetric mass tensor elements {my}. By using (3.1011 and (3.1021 . we can re¬ 
express (3.951 as 

Y' mjj-jr- = f{E + V x B)j, i = 1,2,3, 

(3.103) s 

which is valid in any Cartesian frame of reference. The mass tensor {my} is real and symmetric, and 
hence can always be diagonalized by a principal-axes transformation [11]. The principal masses {rr^} 
are given by (3.991 and the principal axes are given by the principal axes of curvature. 

In (3.991 the third principal mass m 3 is defined in terms of the second derivative, d 2 s/dp 3 2 , in the 
energy-increasing (p 3 -)direction. The first and second principal masses (m ls m 2 ) can be connected 
with the two principal radii of curvature, (P 1? P 2 ), which by definition equal the inverse principal 
curvatures (kappa 1? k 2 ) (Problem 3.8.4): 

1 v , , 1 
(3.1041 «j J V 11 Pj 3 

The equations in (3.1041 are very useful relations. The signs (definitely) and magnitudes 
(qualitatively) of the first two principal masses (m 1? m 2 ) can be obtained by a visual inspection of the 

constant-energy surface, an example of which is the Fermi surface. The sign of the third principal 
mass m 3 can also be obtained by inspection: the mass m 3 is positive or negative according to whether 

the center of the contact circle is on the negative or the positive side. For example, the system of free 
electrons has a spherical constant-energy surface represented by s = p 2 /(2m) with the normal vector 
pointing outward. By inspection the principal radii of curvatures at every point of the surface are 
negative, and the principal masses (m ls m 2 ) are positive and equal to m The third principal mass m 3 

is also positive and equal to m Equation (3.1001 was derived from the energy-k relation (3.851 
without referring to the Fermi energy. It is valid for all wave vectors k and all band indices j. 
Problem 3.8.1. Assume a quadratic dispersion relation (3.961 and derive (3.971 . 

Problem 3.8.2. Assume a general dispersion relation (3.881 and derive (3.1001 . 

Problem 3.8.3. Consider the ellipsoidal constant-energy surface represented by (3.961 with all rrij* > 

0. At the six extremal points, the principal axes of the curvatures match the major axes of the 
ellipsoidal. Demonstrate that the principal masses {mj} at one of these points can be expressed 

simply in terms of the effective masses {mj*}. 

Problem 3.8.4. Verify (3.1041 . Hint: Use a Taylor expansion. 
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1} The vector R is a translational vector, which expresses a lattice periodicity. 

2) We show briefly the derivation of the Fermi distribution function f(e K ). A detailed derivation is 
given in for example [13]. When the Hamiltonian of a system, is given by the sum of a single¬ 
particle Hamiltonian h, that is, n = Xj hj, the grand canonical ensemble average of a number 
operator n K then gives the Fermi distribution function f(s K ): 

s _ TR{n,e a ^ r -^ w } 


where the symbol “TR” denotes a many-body trace, n K> £ k ls the eigenvalue of h, (3 = 

l/(k B T), and a = (3p (pi: the chemical potential). 




Chapter 4 


Phonons and Electron-Phonon Interaction 


Phonons, electron-phonon interaction, and the phonon-exchange attraction are discussed in this 
chapter. 


4.1 Phonons and Lattice Dynamics 


In the present section, we review a general theory of heat capacity based on lattice dynamics. 

Let us take a crystal composed of N atoms. The potential energy y depends on the configuration of 
N atoms located at (r ls r 2 , • • •, r N ). We regard this energy y as a function of the displacements of the 

atoms, 


(4.1) “ r j - ’ 

measured from the equilibrium positions r^ 0 ). 
Let us Taylor expand the potential 

V = V(»1 , » 2 ,..., wi r , 1*12, 1*2*, ■ ■ ■) 

in terms of small displacements {u^}: 


V = V« + 


(4.2) 


j L J 

+ iEEEE w j> w ^[ 

i * k v L 


d 3 V 


dujpdutv 


L 


where all partial derivatives are evaluated at iq = u 2 = • • • = 0, which is indicated by subscript 0. We 
may set the constant y 0 equal to zero with no loss of rigor. By assumption, the lattice is stable at the 

equilibrium configuration. Then the potential y must be at a minimum, requiring that the first-order 
derivatives vanish: 


[Jq =o. 

(4.3) L^ W i> Jo 

For small oscillations we may keep terms of the second order in u J!a only. We then have 

(4.4) j .« i v 2 
where 



The prime (’) on y indicating the harmonic approximation, will be dropped hereafter. The kinetic 
energy of the system is given by 





(4.6) 


t'-Et^-Et^-EE? 


1 . 2 

2 m w j> 


The kinetic energy of all the atoms is the sum of their individual kinetic energies. 
We can now write down the Lagrangian £ = t - v as 


(4.7) 


^ — X] Z] X] X] E 7 AjfttvUjpUkr ■ 


The Lagrangian £ in the harmonic approximation is quadratic in u^ and ^ According to the theory of 

principal-axis transformation [1], we can in principle transform the Hamiltonian (total energy) j h = T 
+ v for the system into the sum of the energies of the normal modes of oscillations: 

JN 

n = T]\(p? + a 2 K <£), 

( 48 ) H 2 

where (Q K , P K } are the normal coordinates and momenta, and co K are characteristic frequencies. We 
note that there are exactly 3 N normal modes. 

Let us first calculate the heat capacity by means of classical statistical mechanics. This Hamiltonian 
U is quadratic in canonical variables (Q K , P K ). Hence, the equipartition theorem holds. We multiply 

the average thermal energy k B T for each mode by the number of modes 3N and obtain 3N k B T for the 
average energy Differentiating this with respect to T, we obtain 3Nk B for the heat capacity, in 

agreement with Dulong-Petit’s law. It is interesting to observe that we obtained this result without 
knowing the actual distribution of normal-mode frequencies. The fact that there are 3 N normal modes 
played an important role. 

Let us now use quantum theory and calculate the heat capacity based on formula 14.8) . The energy 
eigenvalues of the Hamiltonian ^ are given by 

£({M) = {\ + ► ft, = o, 1,2. 

We can interpret (491 in terms of phonons as follows: the energy of the lattice vibrations is 
characterized by the set of numbers of phonons {n K } in the normal modes {k} . Taking the canonical- 

ensemble average of (49), we obtain 

(4 10 ) {E{in})) = + {nx) ) = G + /o ^ ) ) 33 E{T) ’ 

where 


(49) 


(4.11) ~~ exp(e/fr H T)- 1 

is the Planck distribution function. 

The normal-mode frequencies {o\} depend on the normalization volume V, and they are densely 

populated for large V. In the bulk limit, we may convert the sum over the normal modes into a 
frequency integral and obtain 


(4.12) 


(4.13) 


E(T) = Eo 4- J, 

o 

« 

E 0 = j J p 








where £,(co) is the density of states (DOS) in angular frequency defined such that the 

(4.14) number of modes in the interval at 4- dtu) = . 

The constant E 0 represents a temperature-independent zero-point energy. 

Differentiating E(T) with respect to T, we obtain for the heat capacity: 

C ¥ , (. 

(4.15) i aT 

This expression was obtained under the harmonic approximation only, which is good at very low 
temperatures. 

To proceed further, we have to know the density of normal modes, p(co). To find the set of 
characteristic frequencies (co K | requires solving an algebraic equation of 3 Nth order, and we need 

the frequency distribution for large N. This is not a simple matter. In fact, a branch of mathematical 
physics whose principal aim is to find the frequency distribution, is called lattice dynamics. Figure 
4.1 represents a result obtained by Walker [2] after an analysis of the X-ray scattering data for 
aluminum (Al), based on lattice dynamics. Some remarkable features of the curve are: 

Figure 4.1 The density of normal modes p(oi) in the angular frequency G) for aluminum The solid 
curve represents the data deduced from X-ray scattering measurements due to Walker [2]. The broken 
lines indicate the Debye distribution with © D = 328 K. 



1. At low frequencies, 

(4.16) ■ 

2. There exists a maximum frequency co max such that 

(4.17) £*((«} = o for a > vj mai: ■ 

3. A few sharp peaks exist below ro max . 

Feature (1) is common to all crystals. The low-frequency modes can be described adequately in 
terms of longitudinal and transverse elastic waves. This region can be represented very well by the 
Debye continuum model [3], indicated by the broken line. Feature (2) is connected with the lattice 
structure. Briefly, no normal modes of extreme short wavelengths (extreme high frequencies) exist. 
There is a limit frequency co max . Sharp peaks were first predicted by van Hove [4] on topological 

grounds. These peaks are often referred to as van Hove singularities [5, 6]. 








The van Hove singularities, the jumps in the derivative of the density of states, occur in two and 
three dimensions when the constant-frequency plane touches the Brillouin boundary and undergoes a 
curvature inversion. Similar singularities occur for conduction electrons, too. We give a full account 
of the van Hove singularities in the following Section 4.2. 

As we will see later, the cause of superconductivity lies in the electron-phonon interaction [7]. The 
microscopic theory, however, can be formulated in terms of the generalized BCS Hamiltonian [7], see 
Section 7.3, where all phonon variables are eliminated. In this sense the details of lattice dynamics 
are secondary to our main concern. The following point, however, is noteworthy. All lattice 
dynamical calculations start with the assumption of a real crystal lattice. For example, to treat 
aluminum, we start with a face-centered cubic (fee) lattice having empirically known lattice 
constants. The equations of motion for a set of ions are solved under the assumption of a periodic 
lattice-box boundary condition. Thus, the k-vectors used in both lattice dynamics and Bloch electron 
dynamics are the same. The domain of the k-vectors can be restricted to the same first Brillouin zone. 
Colloquially speaking, phonons (bosons) and electrons (fermions) live together in the same Brillouin 
zone, which is equivalent to saying that electrons and phonons share the same house (crystal lattice). 
This affinity between electrons and phonons makes the conservation of momentum in the electron- 
phonon interaction physically meaningful. Thus, the fact that the electron-phonon interaction is the 
cause of superconductivity is not accidental. 

4.2 Van Hove Singularities 

We define the van Hove singularities as jumps in the derivative of the density of states. The property 
arising from the reflection symmetry that the constant-frequency plane touches the Brillouin boundary 
at right angles [15] must be used to complete discussion of the van Hove singularities in 2D and 3D. 

In Section 4.2.1, the normal modes of N coupled harmonic oscillators are obtained and discussed. 
Low-frequency phonons in simple-cubic (sc), body-centered cubic (bcc), tetragonal (tet), and fee 
lattices are analyzed in Section 4.2.2. Van Hove singularities and others are discussed in Section 
4.2.3. 

4.2.1 Particles on a Stretched String (Coupled Harmonic 
Oscillators) 

Let us take a system of N particles, each with mass m, separated by the equilibrium distance 1 on a 
stretched string with a tension x. We consider small vertical displacements yj(t). We assume a 

periodic boundary condition: 

(4.18) rj+w<«)= yjftL j = i,2,...,w. 

The kinetic energy 7 - is given by 

(4.19) r= T^‘ + ^ + "' + ^ ’ 

The potential energy y is given by 

(4 20) V = ll ~~ 4 ‘ + + fri _ > 



which is invariant under cyclic permutation. The Lagrangian £ is then 
c = T-v 

N 


= y _ 

(4.21) h^ 2 J 21 

The equations of motion are 

(4.2D m * J = I [ yj ' +1 “ 2yi + If- 1 ! 
We assume solutions of the form 


(yj+i-yj) 2 ] 


( 423 ) = ex Pt—- J Wfl, 

where k is a k-number. From the periodic boundary condition (4.18) this k-number satisfies 

(4.24) exp(i#J) = exp{it(j -I- N}t] for any j{< N) t 

or 


(4.25) exp(ifcM) = 1 

whose solution is 


fc = (^l) n ~ ** ’ n = ■ 


(4.26) 

Note that k has a dimension of an inverse distance. Substituting (4.23) into (4.22) . we obtain 

at 2 exp[—i{tot — Jfcjt)] = —fy[exp(ifcl) 4- exp{—ifci) — 2] 

(4.27) x - t j t)]. 


or 


to 2 = [1 — cos(fc J}1 = 4tu» sin 2 f^ , 

f4.28f «i L ,J 0 

where ro 0 = is the (positive) frequency of a single harmonic oscillator. By convention, we 

choose a positive angular frequency co: 

(4.29) m > o. 

We then obtain a dispersion relation: 


to = 2(i>o 


sin (f)| 


(430) 

A general solution is composed of 


(4.31) exp(—L©»t} exp{ifc»x), 

where 1^ and co n are respectively given by 
(432 )‘-(£)"- — 2 -h(^)|- 

These solutions characterized by (k^ co n ) represent waves propagating along the line with the phase 
velocity 

(4.33) c ” “ K ' 

We take first the case of an even number of particles, N. Then, N/2 is an integer. The number of 
degrees of freedom for the system is N. Then, there exist only Nk^ The frequency ro n is an increasing 

function of PqJ. We may choose an equal number, N/2, of positive +k n states and negative -k n states. 












The Lagrangian £ can be written as 

(4.34) 2 

where Q kn are the normal coordinates and co n normal mode frequencies. 

We introduce canonical momenta 
„ BC . 

(4.35) km ~ *Qk. ~ ^ 

and construct a Hamiltonian % 

«*iE( p £+”'2ei.) • 

(4.36) 2 ».) 

The frequencies {co n } are distributed, following the dispersion relation 14.321 . 
In the low-frequency (energy) region we obtain from (4.301 

(4.37) <o„ = to 0 l\k n \ For small |t»|. 

This relation is relevant to the low-energy (acoustic) phonons in a solid. 


4.2.2 Low-Frequency Phonons 

1. Simple Cubic Lattice: Let us consider small oscillations for a system of atoms forming a sc 
lattice. We note that polonium (Po) forms a sc lattice [16]. We use a harmonic approximation. 
Assume a longitudinal traveling wave along a cubic axis (x-axis). Imagine hypothetical planes 
perpendicular to the x-axis containing atoms forming a square lattice. This plane has a mass per 
unit square of side-length a (the lattice constant), equal to the atomic mass m The plane is 
subjected to a restoring force per cm 2 equal to Young modulus Y. The dynamics of a set of the 
parallel planes are similar to that of the coupled harmonic oscillators discussed in Section 4.2.1. 
Assume next a transverse wave traveling along the x-axis. The hypothetical planes containing 
the atoms are subjected to a restoring stress equal to the rigidity (shear) modulus S. The 
dynamics is also similar to the coupled harmonic oscillators in one dimension (ID). 
Low-frequency phonons are those to which Debye’s continuum solid model applies. The wave 
equations are 

fPwt J 2 

(4.38) = WJ ' 

where i = 1 (longitudinal) or t (transverse). The longitudinal-wave phase velocity C| is 

[y 

(4.39) C1 V p ’ 

where p is the mass density. The transverse-wave phase velocity c t is 

fs 

(4.40) Ct \p' 

where S is the shear modulus. The waves are superposable. Hence, the movement of phonons 
is not restricted to the crystal’s cubic directions. In short there is a k-vector, k: 

(4.41) ^ = k*®* + ^ r e r + k x e x , 





where e x , e y , e z are the unit vectors in the Cartesian coordinates. The wave propagation is 
isotropic. 

For the transverse waves the polarization vector points in a direction in the 2D polarization 
plane perpendicular to the k-vector. Hence, there are twice as many normal modes as those 
modes for the longitudinal waves. 

2. Body-Centered Cubic Lattice: Alkali metals like sodium (Na) and potassium (K) form bee 
lattices. The bee lattice can be decomposed into two sublattices, one containing the corner atoms 
forming a sc lattice and the other containing body-center atoms also forming a sc lattice. These sc 
sublattices are similar with the same lattice constant a and the same crystal directions. The low- 
frequency phonons in a bee lattice can travel in the same manner as the low-frequency phonons in 
the associated sc sublattice. In short they travel with a linear dispersion relation and 
isotropically. 

3. Tetragonal Lattice: Indium (In) forms a tet lattice (a 1 = a 2 ^ a 3 , a = P = y = 90°). Low-energy 
phonons may travel along one of the crystal axes (x-axis). We may introduce a set of hypothetical 
planes containing atoms. The plane has a mass density per cm 2 and it is subjected to a restoring 
stress. The dynamics of the plane motion is similar to the coupled harmonic oscillators discussed 
earlier. The mass density and the stress depend on the crystal axis along which the acoustic (low- 
energy) phonons travel. The elastic waves travel with linear dispersion relations. The wave 
propagation is angle-dependent or anisotropic, however, because the mass density and the 
restoring stress depend on the directions. 

4. Face-Centered Cubic Lattice: Copper (Cu) and A1 form fee lattices. The fee lattice can be 
decomposed into three sublattices, one containing the eight corner atoms forming a sc lattice with 
the side length a, a second containing the top and bottom face-center atoms forming a sc lattice 
with the side length a, a third containing the side face-center atoms forming a tetragonal lattice 
with the side-lengths (a/,/ 2 , a/,/ 2 , a). The unit cell of a tetragonal sublattice contains two atoms. 
The acoustic longitudinal, transverse phonons associated with the sc-sublattices travel 
isotropically and the phonons associated with the tetragonal sublattice travel anisotropically. 
They have different phase speeds. 

4.2.3 Discussion 

Let us first take a sc lattice with the lattice constant a. We choose Cartesian coordinates (x, y, z) along 
the cubic axes with the center O at a unit cell center. The Brillouin boundary is cubic with the side- 
length 27t/a. The original lattice has a reflection symmetry with respect to the plane: x = 0. It also has 
the same symmetry with respect to the y = 0-plane and the z = 0-plane. The low-frequency 
longitudinal and transverse phonons have linear dispersion relations. Hence, the density of states has 
a quadratic -03 dependence at low frequencies. 

Second, we consider a bee lattice. Since the bee lattice can be decomposed into two similar sc 
sublattices with the lattice constant a and with the same cubic lattice directions, the density of states 
has a quadratic-G) dependence. The phonons for both sc and bee lattices run isotropically. Hence, the 
constant-frequency surface at low frequencies is a sphere. 

Third, we consider a tet lattice. The low-frequency dispersion relation is linear. The density of 



states has a co-quadratic behavior. The phonon proceeds anisotropically, however. 

Fourth, a fee lattice is considered. A fee lattice can be decomposed into two sc sublattices of the 
lattice constant a and a tetragonal sublattice of side-lengths (a/,/ 2 , a/,/ 2 , a). The low-frequency 
dispersion relations for the longitudinal and transverse phonons are all linear. The density of states 
has a co-quadratic behavior. The phonons associated with the tetragonal sublattice are anisotropic 
while the phonons associated with the sc lattice is isotropic. We note that such detailed phonon-mode 
behaviors are not available in Debye’s continuum model theory [3]. 

In a historic paper [17] Einstein resolved the mystery concerning the near-absence of the lattice 
heat capacity of a diamond at room temperature. In his model each atom oscillates harmonically with 
the angular frequency o) 0 . There is no frequency distribution. In our lattice-plane model developed in 

Section 4.2.2, the single-atom frequency co 0 may be identified as 

(4.42) ojo = Einstein's model frequency . 

This G) 0 can be regarded as the average frequency: 

(4.43) "0 = M ■ 

This view is supported by the fact that the Einstein temperature T E is a little lower than the Debye 
temperature T D : 

(4.44) fe-£U0 < frB?D . 

For diamond, experiments indicate that 

(4.45) Te = 1320 K , Tu = I860 K . 

According to 14.281 . the angular frequency cd approaches 2co 0 : 

at highest |k N |l/2 = 7i/2. 

The density of states in ID drops to zero at the maximum frequency. Aluminum forms a fee lattice. 
The fee lattice can be decomposed into two sc sublattices and one tet sublattice. Since the shear 
modulus S is smaller than the Young modulus Y, the transverse phonons have lower energies than the 
longitudinal phonons for the same k-values. The phonons associated with the sc sublattices are more 
easily excited than those associated with the tet sublattice since the restoring stresses are greater in 
the latter (more compact unit cell). Thus, the lowest-energy normal modes come from the transverse 
phonons associated with the sc sublattices. We may then conclude that the first peak in Figure 4.1 
arises from the transverse sc phonons. The rise is greater than the Debye continuum model curve 
(dotted line). This is in line with the co-k (dispersion) relation in (432), which has a negative second 
derivative for increasing k. The sharp peak arises as follows: for very small k, the dispersion relation 
is linear and the constant-frequency plane is a sphere. As the frequency (or energy) increases, the 
constant-frequency plane grows in size and, must approach and touch the six face-centers (singular 
points) of the cubic Brillouin zone boundary. The plane must approach the zone boundary at right 
angles because of the reflection symmetry possessed by the fee lattice [15]. As the frequency 
increases, the constant-frequency plane grows in size, touches the boundary, changes its curvature 
sign, and eventually fills the whole k-space. The 2D cut of the surface development at z = 0 is 
illustrated in Figure 4.2 . The small circle in Figure 4.2a represents the cut of a constant-frequency 


(4.46) 


oj = 2 iOQ sin 







plane for small k. Between Figure 4.2b and 4.2c the curvature sign changes from convex to concave. 
We will show that the change point is the point of the van Hove singularity [6]. 


Figure 4.2 2D intersections at = 0 of the constant frequency planes for a simple cubic lattice in the 

©-increasing order: (a-d). The shaded areas correspond to the lowest-frequency (energy) sides. 
There is a singular point between (b,c) where the curvature sign changes. 
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The density of phonon states in frequency per unit phase space, £>(<»), can be written as 
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14.471 

where the surface integral is over the surface S on which 

(4.48) ®4t) = ffi (= constant) 

and in three dimensions: 


9k 
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The density is high at the contact where 


(4.49) 


5ct> 
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= 0, 


|3(u/8Jt| 


= oo 


But there are only six contact points, which have zero measure in the integral. They contribute nothing 
to the integral (AA1) . The constant-frequency plane near the contact (singular points) is far from 
spherical. A further increase in co generates a collection of singularities forming small circles on the 
zone boundary, which grows in size. The normal modes eventually fill up all the k-space and the 
density of states vanishes thereafter. In the range up to the curvature inversion point ccq the density of 

the states i?(g)) increases with positive first and second derivatives: 
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After the inversion point oeq, we have 
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< 0 p it) > iti 1 , 
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The first region co < oeq in t4.501 corresponds to the case of the growing normal-mode-filled k-space 
(shaded): Figure 4.2a —► b while the second case to > coj in 14.511 to the shrinking empty k-space: 

Figure 4.2c —► d. The density of states D(co) has a maximum, and is continuous at the curvature- 
inversion point coj but its first derivative has a jump. The curvature inversion must exist since the 

total number of normal modes is precisely equal to the number of degrees of the system, and it must 
be contained within the Brillouin zone boundary. The curvature-inversion point is unstable, and can 
be reached smoothly neither from the low-frequency side nor the high-frequency side. There are 
unavoidable fluctuations and dispersions at this point, which may be probed by dynamic transport 
measurements. 

For a ID (linear) crystal, the frequency co is given by 2co 0 | sin(kl/2)|. By direct calculation we 
obtain 

1/2 


1 V 2 n 


— 


(4.52) 

Because of the inversion symmetry, the constant-frequency lines approach the Brillouin zone 
boundary at right angles. There are no curvatures and no curvature inversions. The density of states 
diverges at the end points (co = ±2co 0 ). This singularity can be regarded as a precursor to the van 

Hove singularities occurring in 2D and 3D. 

Essentially the same thing happens for the phonons associated with the tet sublattice. This explains 
the second peak in Figure 4.1 . There are side-shoulders for the first and second peaks on the high- 
frequency sides. These are due to the longitudinal phonons. 

In summary the van Hove singularities (peaks) occur when the constant-frequency plane touches the 
Brillouin zone boundary and undergoes a curvature inversion. There are two prominent peaks in A1 
(fee) due to the transverse phonons associated with the sc sublattices and the tet sublattice. 


4.3 Electron-Phonon Interaction 

A crystal lattice is composed of a regular array of ions. If the ions move, then the electrons must move 
in a changing potential field. Frohlich proposed an interaction Hamiltonian, which is especially 
suitable for transport and superconductivity problems. In the present section we derive the Frohlich 
Hamiltonian [8, 9]. 

For simplicity let us takes a sc lattice. The normal modes of oscillations for a solid are longitudinal 
and transverse running waves characterized by wave vector q and frequency ro q . First, consider the 

case of a longitudinal wave proceeding in the crystal axis x, which is represented by 

(4.531 u 9 exp{— itVqt 4- iq ■ r) = if, exp(— i<o q t 4- iqx), 

where u q is the displacement in the x-direction. The wavelength X = 27t/q(< 2a 0 ) is greater than twice 
the lattice constant a 0 . The case, X = 12a 0 , is shown in Figure 4.3 . 














Figure 4.3 Longitudinal waves proceeding in the x-direction: A, = 12a 0 . Circle (V) indicates a lattice 
ion. 
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If we imagine a set of parallel plates containing a great number of ions fixed in each plate, we have 
a realistic picture of the lattice vibration mode. From Figure 4.3 we see that the density of ions 
changes in the x-direction. Hence, the longitudinal wave modes are also called the density-wave 
modes. The transverse wave modes can also be pictured from Figure 4.3 by imagining a set of 
parallel plates containing a great number of ions fixed in each plate and assuming the transverse 
displacements of the plates. Notice that this mode generates no charge-density variation. 

Now, the Fermi velocity v F in a typical metal is of the order 10 6 m s -1 while the speed of sound is 

of the order 10 3 ms -1 . The electrons are then likely to move quickly to negate any electric field 
generated by the density variations associated with the lattice wave. Hence, the electrons may follow 
the lattice waves quite easily. Given a traveling normal mode in (4.53L we may assume an electron 
density deviation of the form: 

14.541 c 9 ■ r ) ■ 

Since electrons follow phonons immediately for all co q , the factor C q can be regarded as independent 
of (D q . We further assume that the deviation is linear in q • u q = qu q and again in the electron density 
n(r). Thus, 

14.551 = A 9 qu 9 n{r) . 

This is called the deformation potential approximation [18]. The dynamic response factor A q is 

necessarily complex since there is a time delay between the field (cause) and the density variation 
(result). The traveling wave is represented by the exponential form 64.541 . Complex conjugation of 
this equation yields C q * exp(ioo q t- iq • r). Using this form we can reformulate the electron’s response, 
but the physics must be the same. From this consideration we obtain (Problem 4.3.1) 

14.561 A « = A V 

The classical displacement u q changes, following the harmonic equation of motion: 

(4.57) + “W = 0 ■ 

Let us write the corresponding Hamiltonian for each mode as 
(4 58) H = i(p z + tu 2 g 2 ). q = p = q, (o q = oi, 

where we dropped the mode index q. If we assume the same quantum Hamiltonian 'h and the basic 
commutation relations: 

(4.59) [«,p] = tfL [«.«] = Lp. p] = o. 

then the quantum description of a harmonic oscillator is complete. The equations of motion are 
(Problem 4.3.2) 










(4.601 »~a L » ,rtJ ~ y ’ ^ = ^'^ = --9- 
We introduce the dimensionless complex dynamical variables: 

(4.61) flt = (2 hd))~ lfl (p + tog), a = (2hio)~ V2 {p - Ut^). 

Using (4.601 we obtain 

(4.62) = {2ha))~ 1/z {—a) 2 q 4- i*>p) — i ima^ , a — — rna . 

We can express (q, p) in terms of (a, af): 

/*V'fy + t 4 

(4.63) ! V*®) P = (-) (a + °’- 

Thus, we may work entirely in terms of (a, af). After straightforward calculations we obtain 
(Problem 4.3.3): 


a = _qj -|- icoq)[p — i<oq) 

jLt 

= jlP 2 + &V + ia»{gp - pg)] = U - . 


(4.641 


(4.65) w + ■ 


(4M1 


aa 


t- = [ H; H t]= 1 t 


and we finally obtain from (4.641-(4.661 

7i — -I- aa 1 ’) = fta* 4- ^ 

= ftai { n 4- — ^ , n = aU 

(4.67) V V 


The operators (a f , a) satisfy the Base commutation rules (4.661 . The energy-eigenvalue equation for 
ji is now reduced to finding the eigenvalues and eigenstates of the Hermitian operator 

(4.68) n — (da = rd . 

The eigenvalue equation is 


(4.691 "K) = ">') - 

where ft and |ft) are eigenvalues and eigenkets, respectively. 

We shall show that n has as eigenvalues all nonnegative integers. Multiplying (4.691 by the eigenbra 
(ft| from the left, we obtain 
(4.701 {»VaK) - b»') . 

Now, (ft|af a|ft) is the squared length of the ket a|ft) and hence 
(4.711 {ny«K)>o. 

Also, by definitional ft) > 0. Hence from (4.701 and (4.711 . we obtain 
(4.721 ft > Q ; 

the case of equality occurring only if 


(4.73) = 0 ■ 

Consider now [a, n] = [a, a^ a]. We may use the following identities: 

(4.74) [A, BC] = B[A, C] + [A, B]C , [AB, C] = A[B, C ] + [A, C]B 
and obtain 
















(4.75) [ fl , °] + [a, a t ]o = a or an — na = a . 


Rearranging terms in (4.75) and multiplying it by the eigenket | n) from the right, we obtain 

(4.76) na \ n> ) = {an-a)\n') = {n* -\)a\n f ) . 

Now, if a|n) ^ 0, then a|n'j is, according to (4.76) . an eigenket of n belonging to the eigenvalue ri - 
1. Hence, for nonzero n’, n' - 1 is another eigenvalue. We can repeat the argument and deduce that, if 
n' - 1 ^ 0, n' - 2 is another eigenvalue of n. Continuing in this way, we obtain a series of eigenvalues 
n’, n' - 1, n' - 2, • • • that can terminate only with the value 0 because of inequality (4.72) . By a similar 
process, we can show from the Hermitian conjugate of (4.75) : 

(4.77) na* — a*n = a* 

that the eigenvalue of n has no upper limit (Problem 4.3.4). Hence, the eigenvalues of n are 
nonnegative integers: 

(4.78) = o,i, 2 , .. 

The energy-eigenvalue equation for the harmonic oscillator is 

(4.79) K\ n ) = - 

where the energy E is given by 

»-o,i,2,. 

Here we omitted the primes for the eigenvalues since they are self-evident. In summary we obtain 

• Eigenvalues of n = a| a: n' = 0, 1,2, ... 

• Vacuum ket |<|>): a|<|>) = 0 

• Eigenkets ofn: |(<J)j,, aV)> (a^) 2 |c()), ... having the eigenvalues 0, 1, 2,... 

• Eigenvalues of ^: E = (n + i) hco 

The main result (4.80) can be obtained by solving the Schrodinger equation: 



which requires considerable mathematical skills. 

The ket |nj can be expressed in terms of (a, a'). Let |cj>o> be a normalized eigenket of n belonging to 
the eigenvalue 0 so that 

(4.82) = hHM = o. 

This ket is called the vacuum ket. It has the following property: 

(4.83) fl l0o) = 0 ■ 

Using the Bose commutation rules (4.66) we obtain a relation (Problem 4.3.5): 

(4.84) *{*Y- = n'Uh"'- 1 , 

which may be proved by induction. Multiply (4.84) by a^ from the left and operate the result to |® 0 ). 
Using (4.83) we obtain 

(4.85) "(4)"Vo) = "'(ab^Vo) ’ 

indicating that (a^*) n j(|) 0 ) is an eigenket belonging to the eigenvalue n'. The square length of (a t ) n ’|([) 0 ) is 

(4.86) {^|fl n VrV»> = =-- = n f i. 

We see from (4.85) that a|n') is an eigenket of n belonging to the eigenvalue n' - 1. Similarly, we can 

















show from [a, a 1 ] = a^ that a^|n'} is an eigenket of n belonging to the eigenvalue n' + 1. Thus, operator 
a, acting on the number eigenket, annihilates a particle, while operator a’ creates a particle. 
Therefore, a and a^ are called annihilation and creation operators, respectively. 

In summary, the quantum Hamiltonian and the quantum states of a harmonic oscillator can be simply 
described in terms of the bosonic second-quantized operators (a, a^"). 

We now go back to the case of the lattice normal modes. Each normal mode corresponds to a 
harmonic oscillator characterized by (q, co q ). The displacements u q can be expressed as 

where (a q , a q ^) are annihilation and creation operators satisfying the Bose commutation rules: 

(4.88) K fl '] “ a * a * ~ a * a * = ’ [B * ,0p] = K = 0 ■ 

We can express the electron density (field) by 

(4.89) = 

where \|/(r) and \|/^(r) are annihilation and creation electron field operators, respectively, satisfying 
the following Fermi anticommutation rules: 

V>V}[ = V'WV) -I- **) , 

(4.90) {tU(r), Air')} = 0 ■ 

The field operators v|/(\|/^) can be expanded in terms of the momentum-state electron operators 

£491) 

where operators (c, c^) satisfy the Fermi anticommutation rules: 

(4 92) \ l ' t ’ = c A k = = o 

Let us now construct an interaction Hamiltonian ^ F , which has the dimension of an energy and 
which is Hermitian. We propose 

?* F = /d 3 rV[A g gw fl exp{iq r)V J+ (r)V’(r)-l-h.c.] , 

(4.93) J ■ 

where h.c. denotes the Hermitian conjugate. Using f4.55k f4.87k and f4.91I we can express f4.93f in 
a second-quantized form (Problem 4.3.6): 

//i n/i\ = + K c I c h-« h I) - = lA i s ■ 

f4.94f tq 

This is the Frohlich Hamiltonian. Electrons describable in terms of c k ’s are now coupled with 
phonons describable in terms of a q ’s. The term 

V A+q C t*q 

can be pictured as an interaction process in which a phonon is absorbed (emitted) by an electron as 
represented by the Feynman diagram [10, 11] in Figure 4.4a fbf Note: at each vertex the momentum is 
conserved. The Frohlich Hamiltonian is applicable for longitudinal phonons only. As noted 

earlier, the transverse lattice normal modes generate no charge density variations, making the 


y<r) = -j= exp(i* ■ r )c t , tffrr) = -j= exp(-i*' ■ r)c\ 











electron-transverse phonon interaction negligible. 

Figure 4.4 Feynman diagrams representing (a) absorption and (b) emission of a phonon (indicated by 
the dashed line) by an electron. They correspond to the interaction terms in 14.94) . 



(a) tb> 


Problem 4.3.1. Prove 14.56) . 

Problem 4.3.2. Verify 14.60) . 

Problem 4.3.3. Verify 14.64W4.66) . 

Problem 4.3.4. Verify that the eigenvalues of n have no upper limit. As shown in the text the 
eigenvalues are separated by unity and the lower bound is zero. Hence the eigenvalues n' are 0, 1,2, 

Problem 4.3.5. Prove 14.84) by mathematical induction. 

Problem 4.3.6. Show that the second-quantized form of is given by 14.94) . 


4.4 Phonon-Exchange Attraction 


By exchanging a phonon, two electrons can gain an attraction under a certain condition. In this section 
we treat this effect by using the many-body perturbation method [12, 13]. 

Let us consider an electron-phonon system characterized by 

H = ^ ^ -I- ^ + 2 ^ 

+A E E E + hc -) 

t i 1 


14.95) = + = = 

where the three sums represent respectively the total electron kinetic energy (^ el ), the total phonon 
energy (^ ph ), and the Frohlich interaction Hamiltonian (^ F ) (see 14.94) ). 


For comparison we consider an electron gas system characterized by the Hamiltonian 

% = EE s ‘ f U. + jE ,,, D 1 ' 2 i v c 44 =^ 1 +v c „ 

14,96) * s * 1*1 **** 


where 


{1 P 2|V C |3 P 4) = {*l5 lp * 2 J 2 |V C |* jS3 ,W4> 

1 . ft ft r 

^ = y ^2 6 *3 +*4.*1 +*2 0 Si ,*3 0 ■ 

The elementary interaction process can be represented by a diagram as in Figure 4.5 . The wavy 
horizontal line represents the instantaneous Coulomb interaction y c . The net momentum of a pair of 

electrons is conserved: 
















Figure 4.5 The Coulomb interaction represented by the horizontal wavy line generates the change in 
the momenta of two electrons. 



(4.98) *i-r* 2 = *j + *4, 

as seen by the appearance of the Kronecker delta in (4.97) . Physically, the Coulomb force between a 
pair of electrons is an internal force, and hence it cannot change the net momentum 
We wish to find an effective Hamiltonian y eff between a pair of electrons generated by a phonon 
exchange. If we look for this y eff in the second order in the coupling constant X, the likely candidates 

may be represented by the two Feynman diagrams^* in Figure 4.6 . Here, the time is measured upward. 
In the diagrams in Figure 4.6a.b . we follow the motion of two electrons. We may therefore consider a 
system of two electrons and obtain the effective Hamiltonian y eff through a study of the evolution of 

two-body density operator p 2 . For brevity we shall hereafter drop the subscript 2 on p indicating the 
two-body system 


Figure 4.6 (a,b) A one-phonon exchange process generates the change in the momenta of two 
electrons similar to that caused by the Coulomb interaction. 
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The system-density operator p(t) changes in time, following the quantum Liouville equation: 

(499) 

where the superscript (x) on ^ indicates the Liouville operator.^* We assume the Hamiltonian ^ in 
(4.96k and study the time evolution of p(t), using quantum many-body perturbation theory. We sketch 
only the important steps; more detailed calculations are given in Fujita and Godoy’s book [12, 13]. 

Let us introduce quantum Liouville operators^ 1 , 

(4.100) = +AV*, 

which generate a commutator upon acting on p, see (4.99) . We assume that the initial-density operator 
p 0 for the combined electron-phonon system can be written as 


(4.101) P® — Pelertriin ® Pphooan j 

which is reasonable at 0 K, where there are no real phonons and only virtual phonons are involved in 
the dynamical processes. We can then choose 












(4.102) Pphonon = |0)(0| r 

where |0) is the vacuum-state ket for phonons: 

(4.103) = 0 for any g. 

The phonon vacuum average will be denoted by an upper bar or angular brackets: 

(4.104) P(0 = (0|P(OI°> = • 

Using a time-dependent perturbation theory and taking a phonon-average, we obtain (Problem 4.4.1) 
from (4J99) 

dr{V x exp(—irA - )V X p(t — r)) lv . 

In the weak-coupling approximation, we may calculate the phonon-exchange effect to the lowest 
(second) order in X, so that we obtain (Problem 4.4.2) 

2 2 {V X exp(-lrfi _1 ?iJ : )V x p(i - r)) 
t4.106^ - 2 2 {V X exp(-irfc _1 7^)V x > av p(t - r) . 

In the Markovian approximation we may replace p(t - x) by p(t) and take the upper limit t of the x- 
integration to oo. Using these two approximations, we obtain from (4.105) 

1QT) - U 2 * -1 hm(V x (^o - ia) -1 V x )„?(*) - « > 0 . 

Let us now take momentum-state matrix elements of (4.107) . The lhs is 

9 3 

(4 108) 3f{ klSl ' k2, Sl MOl k 3Sj,fc4S4> = ^p(L2;3,4, t) r 

where we dropped the upper bar indicating the phonon average. The rhs requires more sophisticated 
computations since the Liouville operators (y x , ^ 0 X ) are involved. After lengthy but straightforward 
calculations, we obtain (Problem 4.4.3) from (4.107) 

—p(l,2;3.4,t) = J2 *) 

(4.109) - P(l- 2:5,6, t){5, G| Veffl 3,4)] , 

where (l,2|y eff |3,4) is given by 


3p(t) X 2 f 

dt ** ] 


(l,2|VdF|^4) = \V q \ 2 


tiojq 

fri - £s) 2 - ti 2 ct) 2 


(4. 1 10) X 1+^*1— *3><f ' 

The Kronecker delta 8 k +k in (4.110) means that the net momentum is conserved, since the 

phonon-exchange interaction is an internal interaction. 

For comparison, consider the electron-gas system characterized by the Hamiltonian in (4.96) . 
The two-electron density matrix %c for this system changes, following 


_ 3 _ 

9t 


Pc (1.2; 3,4, t) = ££-* 1 [{1.2|Vc|S, 6)pc(S, 6; 3,4, t) 


(4.111) -Pc(1.2;S,6J)(S p 6|Vc|3 p 4)] p 

which is of the same form as (4.109) . The only differences are in the interaction matrix elements. 

















Comparison between (4.971 and 4.110) yields (Problem 4.4.4) 

1 

<4.112) V 7 

ha> a 


(Coulomb interaction) 


h 2 a ) 2 


(phonon-exchange interaction). 


(4.113) ( c *i+9 £ *i) 2 

In our derivation, the weak-coupling and the Markovian approximation were used. The Markovian 
approximation is justified in the steady-state condition in which the effect of the duration of 
interaction can be neglected. The electron mass is four orders of magnitude smaller than the lattice- 
ion mass, and hence the coupling between the electron and ionic motion must be small by the mass 
mismatch. Thus, expression <4.1131 is highly accurate for the effective phonon-exchange interaction at 
0 K. This expression has remarkable features. First, it depends on the phonon energy hoo q . Second, it 

depends on the electron energy difference s ki+q - e k before and after the transition. Third, if 


(4.114) ®*il ’ 

the effective interaction is attractive. Fourth, the attraction is greatest when e ki+q - e k = 0, that is, 

when the phonon momentum q is parallel to the constant energy (Fermi) surface. A bound electron 
pair, called a Cooper pair, may be formed due to the phonon-exchange attraction as demonstrated by 
Cooper [14], which will be discussed in the following chapter. 

Problem 4.4.1. Derive (4.1051 . 

Problem 4.4.2. Derive (4.1061 . 

Problem 4.4.3. Derive (4.1091 along with (4.1101 . 

Problem 4.4.4. Derive the expressions (4.1121 and (4.1131 . 
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Chapter 5 


Electrical Conductivity of Multiwalled Nanotubes 


The electrical conductivity of carbon nanotubes varies, depending on the temperature, radius, and 
pitch of the sample. In the majority of cases, the resistance decreases with increasing temperature, 
indicating a thermally activated process. The standard band theory based on the Wigner-Seitz (WS) 
cell model predicts a gapless semiconductor, which does not account for the thermal activation. A 
new band model in which an “electron” (“hole”) has an orthogonal unit cell size for graphene is 
proposed. The normal charge carriers in graphene transport are “electrons” and “holes.” The 
“electron” (“hole”) wavepackets extend over the unit cell and carry the charges - e (+e). Thermally 
activated “electrons” or “holes” are shown to generate the observed temperature behavior of the 
conductivity in multiwalled nanotubes. 


5.1 Introduction 

In 1991 Iijima [1] discovered carbon nanotubes in the soot created in an electric discharge between 
two carbon electrodes. A nanotube (nt) can be considered as a single sheet of graphite, called 
graphene, that is rolled up into a tube. A single-wall nanotube (swnt) has a radius of 5-10 A while a 
multiwalled nanotube (mwnt) rolled like wallpaper has a size exceeding 10 nm (= 100 A) in radius 
and microns in length. Nanotubes have remarkable mechanical properties that can be exploited to 
strengthen materials. And since they are composed entirely of carbon, nanotubes are light and also 
have a low specific heat. Ebbesen et al. [2] measured the electrical conductivity of individual 
nanotubes. In the majority of cases, the resistance R decreases with increasing temperature T while 
the resistance R for a normal metal like copper (Cu) increases with T. This temperature behavior in 
nanotubes indicates a thermally activated process. Schonenberger and Forro [3] reviewed many 
aspects of carbon nanotubes; this review and [2] contain many important references. The current band 
theory based on the Wigner-Seitz (WS) cell model [4] predicts a gapless semiconductor for graphene 
and cannot explain the observed T-behavior. A new theory is required. The WS model is suited for 
the study of the ground state energy of a crystal [5]. To treat electron motion for a honeycomb lattice, 
we must introduce a different unit cell [5]. We present a new theoretical model for Bloch electron 
dynamics [6]. 

5.2 Graphene 

Following Ashcroft and Mermin [5], we adopt the semiclassical electron dynamics in solids. In the 
semiclassical (wavepacket) theory, it is necessary to introduce a k-vector: 

(5 \) k = k x e x + kyCy + k x e x , (e*, £ z : Cartesian unit vectors) 


since the k-vector is involved in the semiclassical equation of motion: 
dfc 

(5.2) hk " h ii = q[E + vxB) ’ 

where E and B are the electric and magnetic fields, respectively, and the vector 

i— 

( 5 . 3 ) v =h%k 

is the electron velocity where s is the energy. The choice of the Cartesian axes and the unit cell for a 
simple cubic (sc) crystal is obvious. We choose the center of the cube as the origin and take the x-, y-, 
and z-axis along the cube sides. Two-dimensional crystals such as graphene can also be treated 
similarly, with only the z-component being dropped. We must choose an orthogonal unit cell for the 
honeycomb lattice, as shown below. 

Graphene forms a 2D honeycomb lattice. The WS unit cell is a rhombus (shaded area) as shown in 
Figure 5.1a . The potential energy y(r) is lattice, periodic: 

Figure 5.1 (a) The WS unit cell, a rhombus (shaded area) for graphene, (b) The orthogonal unit cell, 
a rectangle (dotted lines) with side lengths (b, c). Open circles (o) indicate the C + ions and the lines 
between open circles indicate the chemical bonds. 
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(5.4) V(r + R mtt ) = V(r) , 
where 

(5.5) Rmn = ma-i + na 2 , 

are Bravais vectors with the primitive vectors (a 1? a 2 ) and integers (m, n).^ 1 In the field theoretical 
formulation the field point r is given by 

(5.6) r = r' + R m n , 

where r’ is the point defined within the standard unit cell. Equation 15.41 describes the 2D lattice 
periodicity but does not establish the k-space, which is explained below. 

We first consider an electron in a simple square (sq) lattice. The Schrodinger wave equation is 

02 ) W ^ (r) = -^ v2,p,f » + V! ’ ,r,v(r >- 






























The Bravais vector for the sq lattice R nin (0) is 

(5.8) ttmit = ma* + na Y = mae x + nae Y (a = lattice constant) . 

The system is lattice periodic: 

(5 9) "I” = ■ 

If we choose a set of Cartesian coordinates (x, y) along the sq lattice, then the Laplacian term in 15.71 
is given by 

( g2 \ 

-r-rr + 7-7 1 v) ■ 

If we choose a periodic sq boundary with the side length Na (N: integer), then there are 2D Fourier 
transforms and (2D) k-vectors. 

We now go back to the original graphene system If we choose the x-axis along either or a 2 , then 

the potential energy field y(r) is periodic in the x-direction, but it is aperiodic in the y-direction. For 
an infinite lattice the periodic boundary is the only acceptable boundary condition for the Fourier 
transformation. Then, there is no 2D k-space spanned by 2D k-vectors. If we omit the kinetic energy 
term, then we can still use t5.41 and obtain the ground state energy (except the zero-point energy). 

We now choose the orthogonal unit cell as shown in Figure 5.1b for graphene [6]. The unit cell 
(rectangle) has side lengths: 

(5.11) b = , c = 3a<>, 

where a 0 is the nearest-neighbor distance between two C + ions. The unit cell has four (4) C + atoms. 
The system is lattice periodic in the x- and y-directions, and hence there are 2D k-spaces. 

The importance of using the Cartesian unit cell is also underscored by the stability of the lattice 
structure. If a lattice is in a stable equilibrium, then the excitation energy of an electron (and a 
phonon) must be quadratic in each Cartesian component of the momentum This property is assured if 
the lattice has an inversion symmetry. The honeycomb lattice clearly has a reflection (mirror) 
symmetry about the x-and y-axis in Figure 5.1b . The reflection symmetry can be discussed only in 
terms of the Cartesian frame of coordinates. 

We shall further discuss lattice stability and reflection symmetry in detail in Section 5.3. 

The “electron” (“hole”) in the present text is defined as a quasielectron that has an energy higher 
(lower) than the Fermi energy s F and “electrons” (“holes”) are excited on the positive (negative) side 

of the Fermi surface with the convention that the positive normal vector at the surface points in the 
energy-increasing direction. 

The “electron” (wavepacket) may move up or down along the y-axis to the neighboring hexagon 
sites passing over one C + . The positively charged C + acts as a welcoming (favorable) potential 
valley for the negatively charged “electron” while the same C + act as a hindering potential hill for the 
positively charged “hole.” The “hole,” however, can move horizontally along the x-axis without 
meeting the hindering potential hills. Thus, the easy channel direction for the “electrons” (“holes”) 
are along the y-(x-)axes, see Figure 5.1b . 

Let us consider the system (graphene) at 0 K. If we put an electron in the crystal, then the electron 
should occupy the center O of the Brillouin zone, where the lowest energy lies. Additional electrons 







occupy points neighboring the center O in consideration of Pauli’s exclusion principle. The electron 
distribution is lattice periodic over the entire crystal in accordance with the Bloch theorem [8]. 

Carbon (C) is a quadrivalent metal. The first few low-lying energy bands are completely filled. The 
uppermost partially filled bands are important for discussion of transport properties. We consider 
such a band. The (2D) Fermi surface, which defines the boundary between the filled and unfilled k- 
space (area), is not a circle since the x-y symmetry is broken (b ± c). The effective mass of the 
“electron” is lighter in the y-direction than perpendicular to it. Hence, the motion of the electron is 
intrinsically anisotropic. The negatively charged “electron” stays close to the positive C + ions while 
the “hole” is farther away from the C + ions. Hence, the gain in Coulomb interaction is greater for the 
“electron.” That is, the “electron” is more easily activated. Thus, the “electrons” are the majority 
carriers at zero gate voltage. 

We may represent the activation energy difference by [6] 

(5.12) ci < si. 

The thermally activated (or excited) electron densities are given by 

(5.13) nj{T) = nje- £ ^ kaT K 

where j = 1 and 2 represent the “electron” and “hole,” respectively. The prefactor nj is the density at 
the high-temperature limit. 

5.3 Lattice Stability and Reflection Symmetry 

In 1956 Lee and Yang published a historic paper [9] on parity nonconservation for neutrinos after 
examining the space inversion property of the massless Dirac relativistic wave equation. Inversion 
and reflection symmetry properties are also important in solid state physics. They play important 
roles for the stability of crystal lattices in which electrons and phonons move. On the basis of the 
reflection symmetry properties, we show that the monoclinic (mcl) crystal has a one-dimensional 
(ID) k-space, and the triclinic (tel) crystal has no k-vectors for electrons. For phonons a tel crystal 
has three disjoint sets of ID nonorthogonal k-vectors. The phonons’ motion is highly directional, and 
there are no spherical waves formed. 

There are seven crystal systems, as seen in the book by Ashcroft and Mermin [5]. They are the 
cubic (cub), tetragonal (tet), orthorhombic (ore), monoclinic (mcl), rhombohedral (rhl), hexagonal 
(hex), and triclinic (tel) systems. Arsenic (As) and Bismuth (Bi) form rhl crystals. A rhl crystal can 
be obtained by stretching (or contracting) the three body-diagonal distances from a sc crystal. But the 
body-diagonal directions remain orthogonal to each other after any stretching. Hence, if an orthogonal 
unit cell with the Cartesian axes along the body diagonals containing six corner atoms is chosen, then 
the system is periodic along the x-, y-, and z-axes passing the center. Thus, the system can be regarded 
as an ore crystal, and hence it has a 3D k-space. Diamond (C), silicon (Si), and germanium (Ge) form 
diamond (dia) crystals. A dia lattice can be decomposed into two face-centered cubic (fee) 
sublattices, and can therefore be treated similarly to a cub crystal. A number of elements including 
graphite form hex crystals, hex crystals can be treated similarly to ore crystals by choosing orthogonal 
unit cells. See below for the case of graphite. 

A crystal lattice must be stable. If the lattice is symmetric under the space inversion: 


15.141 r->-r. 

then the electron energy s is quadratic in (k x , k y , k z ) near the origin. The five systems, cub, tet, ore, 

hex, rhl, have inversion symmetry. The mcl has a mirror symmetry only with respect to the x-y plane. 
Therefore, the mcl has a ID k-space. No reflection symmetry is found for a tel crystal. Therefore, the 
tel must be an insulator. 

A mcl crystal has a c-axis. It is reflection-symmetric with respect to the x-y plane perpendicular to 
the c-(z-)axis. It has only ID k-vectors along the c-axis. A tel crystal has no reflection symmetry and 
it has therefore no k-vectors. Hence, it is an intrinsic insulator. 

In summary, cub, tet, ore, rhl, and hex crystals have 3D k-space spanned by 3D k-vectors. mcl 
crystals have ID k-space. tel crystals have no k-vectors. 

This is a significant finding. The same results can also be obtained by using the Schrodinger 
equations as shown in Section 5.2 

Let us now consider small oscillations for a system of atoms forming a sc lattice. Assume a 
longitudinal wave traveling along a cubic axis (say, the x-axis). Imagine hypothetical planes 
perpendicular to the x-axis containing atoms forming a square lattice. This plane has a mass per unit 
square of side length a (the lattice constant), equal to the atomic mass m The plane is subjected to a 
restoring force per cm 2 equal to the Young modulus Y. The dynamics of a set of the parallel planes are 
similar to that of coupled harmonic oscillators. 

Assume next a transverse wave traveling along the x-axis. The hypothetical planes containing many 
atoms are subjected to a restoring stress equal to the rigidity (shear) modulus S. The dynamics is also 
similar to the coupled harmonic oscillators in ID. 

Low-frequency phonons are those to which Debye’s continuum solid model [10] applied. The wave 
equations are 




— cf V 2 u t , 
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where i = 1 (longitudinal) or t (transverse). The longitudinal wave phase velocity Cj is 


q = 




(5.16) 

where p is the mass density. The transverse wave phase velocity c t is 


Ct 

(5.17) 

The waves are superposable. Hence, the movement of phonons is not restricted only to the crystal’s 
cubic directions. In short, there is a 3D k-vector, k. The wave propagation is isotropic. 

Consider now an ore crystal. We may choose a Cartesian coordinate (x, y, z) passing through the 
center of the unit cell. The small oscillations are similar to the case of a sc lattice. The dynamics of 
the parallel plates are the same but the restoring forces are different in x-, y-, z-directions. The plane 
waves have different phase velocities, depending on the directions. They are superposable since 
these waves are still solutions of the wave Equations 15.151 . 

Phonons are quanta corresponding to the running plane wave modes of lattice vibrations. Phonons 
are bosons, and the energies are distributed, following the Planck distribution function: 






(5.18) ' 

There is no activation energy as in the case of the “electrons.” This arises from the boson nature of 
phonons. The temperature T alone determines the average number and the average energy. 

Phonons and conduction electrons are generated based on the same lattice-and k-spaces. This is 
important when describing the electron-phonon interaction. 

The “electrons” and “holes” by postulate have the same orthogonal unit cell size. Phonon size is 
much greater than the electron size. The low-energy phonons have small k and great wavelengths. The 
average energy of a fermionic electron is greater than a bosonic phonon by two or more orders of 
magnitude. This establishes the usual physical picture that a point-like electron runs and is 
occasionally scattered by a cloud-like phonon in the crystal. 

We saw earlier that a mcl crystal has ID k-vectors pointing along the c-axis for the electrons. There 
are similar ID k-vectors for phonons. Besides, there are two other sets of ID k-vectors. Plane waves 
running in the z-direction can be visualized by imagining the parallel plates, each containing a great 
number of atoms executing small longitudinal and transverse oscillations. Thus, plane waves 
proceeding upward or downward exist. 

Consider an oblique net of points (atoms) viewed from the top as shown in Figure 5.2 . Planes 
defined by the vector a and the x-axis are parallel and each plane contains a great number of atoms. 
Planes defined by the vector b and the c-axis are also parallel, and each contains a number of atoms 
also executing small oscillations. These three sets of ID phonons stabilize the lattice. 

Figure 5.2 An oblique net with base vectors (a, b). 



We next consider a tel crystal, which has no k-vectors for electrons. There are, however, three sets 
of ID k-vectors for phonons as shown below. Take a primitive tel unit cell. The opposing faces are 
parallel to each other. There are restoring forces characterized by the Young modulus Y and shear 
modulus S. Then, there are ID k-vectors perpendicular to the faces. The set of ID phonons can 
stabilize the lattice. These phonons in tel are highly directional. There are no spherical waves 
formed. We used the lattice property that the facing planes are parallel. This parallel plane 
configuration is common to all seven crystal systems. A typical hex system, graphene, clearly has 
three sets of parallel material planes containing many atoms. 

The rhl system similarly has parallel planes. The parallel material plane configuration is the basic 
condition for phonon generation and lattice stability. 

In summary, we established based on the reflection symmetry properties of crystals that: 









• The cub, tet, ore, rhl, and hex crystal systems have 3D k-spaces for electrons. The mcl system has 
a ID k-space. The tel crystal system has no k-vectors. 

• The mcl and tel crystal systems have ID phonons, which are highly directional. No spherical 
phonon distributions are generated. 

• The parallel material plane configuration is the basic condition for phonon generation and lattice 
stability. 

• For rhl and hex crystals orthogonal unit cells must be chosen for electron dynamics. 

• “Electrons” and “holes” have the same unit cell size, and they move with different effective 
masses. “Electrons” and “holes” in semiconductors are excited with different activation 
energies. Phonons can be excited with no activation energies. 

• Both phonons and electrons are generated based on the same orthogonal unit cells. This fact is 
important when dealing with the electron-phonon interaction. 

• Both electrons and phonons move as wavepackets. The electron size is the orthogonal unit cell 
size. The average phonon size is greater by two or more orders of magnitude at room 
temperature. 

5.4 Single-Wall Nanotubes 

A tube made of a graphene sheet (a single graphite layer) rolled up into a circular cylinder is called a 
single-wall nanotube. The charge may be transported by the channeling “electrons” and “holes” in the 
graphene wall. But the “holes” present in the inner surface of the swnt also contribute to charge 
transport. The carbon ions in the wall are positively charged. Hence, the positively charged “holes” 
move inside the tube wall. In contrast, the negatively charged “electrons” are attracted by the carbon 
wall and hence cannot travel in a straight line inside the tube wall. Because of this extra channel in 
the inner surface of the carbon wall, “holes” are the majority carriers in nanotubes although 
“electrons” are the dominant carriers in graphene. Moriyama et al. [11] observed electrical transport 
in swnts in the temperature range 2.6-200 K, and determined from a field effect (gate voltage) study 
that the carriers are “holes.” 

The conductivity was found to depend on the helicity or the pitch of the swnt. The helical line is 
defined as the line passing through the centers of the nearest neighbors of the C + -hexagons. The 
helical angle (p is the angle between the helical line and the tube axis.^ The degree of helicity h may 
be defined as 

(5.19) h = cosy . 

For a macroscopically large graphene the conductivity does not show any directional dependence as 
we saw in Section 5.2. The conductivity a in a swnt was experimentally found to depend on the 
helicity h [2]. This is a kind of finite size effect. The circumference, that is, 2n times the tube radius is 
finite while the tube length is macroscopic. 


5.5 Multiwalled Nanotubes 


A graphene sheet can be rolled like wallpaper to produce a multiwalled nanotube (see Figure 1.2 V 



The radius of a mwnt tube can reach 100 A. The “holes” run in shells between the walls. The shell 
area is greatest at the greatest radius. The “hole” current flows should therefore be greatest in the 
outermost shells, which is what is observed. The “hole” current in the shells should also be an 
activated process and the “hole” mass m 3 is different from the graphene “hole” mass m 2 in the 
carbon wall. 

We are now ready to discuss the electrical conductivity of nanotubes. There are four currents 
carried by [6] 

(a) “Electrons” moving in graphene walls with mass n^* and density n x exp(—s^kgT)), running 
through the channels (110) = [110], [Oil], and [101]. 

(b) “Holes” moving inside graphene walls with mass m 2 * and density n 2 exp(-s 2 /k B T)), running 
through the channels (100) = [100], [010], and [001]. 

(c) “Holes” moving in shells between the walls with the mass m 3 * and the densities n 3 
exp(-e 3 /(k B T)) running in the tube-axis direction. The activation (or excitation) energy s 3 and the 
effective mass m 3 may vary with the radius of the shell and the pitch. 


(d) Cooper pairs (pairons), which are formed by the phonon-exchange attraction. 

In actuality, one of the currents may be dominant, and be observed. 

In the normal Ohmic conduction the resistance R is proportional to the sample (tube) length. Then, 
the conductivity a can be defined, and this a is given by the Drude formula: 
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where q is the carrier charge (±e), m the effective mass, n the carrier density, and x the relaxation 
(collision) time. The relaxation rate T = x _1 is the inverse of the relaxation time. If the inpurities and 
phonons are scatterers, then the rate T is given by the sum of the inpurity scattering rate T imp and the 

phonon scattering rate r ph (T) (Matthiessen’s rule [5]): 


(5.21) ^ = ^ m p T ^pfo(-0 ■ 

The impurity scattering rate T imp is temperature-independent, and the phonon scattering rate T ph (T) is 
temperature (T)-dependent. The phonon rate T ph (T) is linear in T above 4 K: 

(5.22) JpbCO = »T , a = constant. 


The temperature dependence of the conductivity should arise from the carrier density n(T) and the 
phonon scattering rate r ph (T). Writing the T-dependence explicitly, we obtain ffom f5.14W5.22f 
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(5.23) 

Ebbesen et al. [2] measured the electrical conductivity of individual nanotubes of different radii 
and pitches. The data are reproduced in Figure 5.3 . Sample data were detailed in Table 1 in [2]. 
Resistance R is seen to decrease with increasing temperature T for NT8 and NT2. They may be 
interpreted in terms of the three normal currents (a)-(c) mentioned above. NT4 and NT5 were 
analyzed using a semilog-arithmic plot of R versus T -1 as shown in [2, Figure 3], which is 
reproduced in Figure 5.4 . 









Figure 5.3 Temperature dependence of the resistance of different NTs after Ebbesen et al. [2]. (a) 
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Figure 5.4 Semilogarithmic plot of the resistance R ofNT4 and NT5 versus the inverse of the 
temperature after Ebbesen et at. [2]. 












The slopes yield activation energies s of 0.1 and 0.3 eV for NT4 and NT5, respectively. The 
currents in NT5 should be due to the “electrons” running through the channels (110) in the graphene 
wall or the “holes” running in the channels {100). The Hall coefficient or the Seebeck coefficient 
measurements will answer this question. The lack of linearity in NT4 suggests that the relevant 
currents are due to the “holes” moving in shells along the tube axis. The activation energy s 3 depends 

on the tube radius and circumference. Since the currents along the tube axis are from the shells 
between walls of different circumferences, there is a distribution of s 3 , which destroys strict linearity. 

NT7a in Figure 5.3 shows a resistance increase starting above around 220 K. A possible cause for 
this is a “neck” Fermi surface where the electron effective mass becomes abnormally high, making the 
conductivity very small. If this is the case, the resistance should rise and fall as the temperature is 
increased further. This needs to be checked experimentally. 

NT6 in Figure 5.3 shows a very small T-independent resistance at room temperature. A room- 
temperature superconductivity, if it exists, is a quite remarkable phenomenon. The superconducting 
state has a critical temperature T c above which the normal currents flow. The data indicates that the 

critical temperature T c is higher than 350 K. It is desirable to perform experiments to see if the 
resistance is recovered at higher temperatures. 

5.6 Summary and Discussion 

The conductivity of carbon nanotubes is varied, depending on the temperature T, the tube radius, and 
pitch. In the majority of cases, the conductivity a decreases with increasing T, often called the 
semiconductor-like T-behavior. The current band theory based on the WS cell model predicts a 
gapless semiconductor, and cannot explain the observed T-behavior. A new band model suitable for 





the Bloch electron dynamics is presented, in which the “electron” (“hole”) has not only the charge - 
(+)e but also the rectangular size. 

Applying our model to graphene and nanotubes, we have obtained the following results: 

• In graphene “electrons” (1) are easier to excite than “holes” (2) with the activation energies s, < 
e 2 . The majority carriers are “electrons” without external fields. 

• The “electron” motion is intrinsically anisotropic so that the effective mass m 2 is much smaller 
in {110) than perpendicular to it. 

• If the “electron” density is varied by the application of a gate voltage, then a “neck” Fermi 
surface develops. 

• The “electron” (“hole”) moves easily in the direction (110) ({100)) with the effective mass n^* 
(m 2 *). The channeling “electrons” and “holes,” however, generate isotropic currents. 

• In swnts “electrons” and “holes” can run in the carbon wall in the same manner as in graphene. 

• The tube length is macroscopic. But the circumference is finite, and it is measured in nanometers. 
This generates a helicity (pitch) dependence for the conductivity. 

• The “holes” on the inner side of the carbon wall can contribute to the conduction. Because of this 
extra channel the majority carriers in swnts are “holes.” 

• In mwnt “holes” run in shells between carbon walls. The “hole” currents depend on the size of 
the shells. The currents are greatest for the outermost shells. 

• Only “holes” can be excited in the shells with the activation energies s 3 . The s 3 depend on the 

circumference. A distribution of s 3 destroys the strict Arrhenius law. 

The conductivity in mwnts varies widely. The behavior in most cases can be interpreted in terms of 
“electrons” and “holes” and using the Drude formula (5.201 . Some nanotube samples show a 
nonlinear current-voltage behavior; a superconductivity is suspected, which will be treated in 
Chapter 8. 
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1) Referring to Figure 5.1a the primitive lattice vectors can be written as a l = (1/2, A /3/2)a 0 , a 2 = 

(-1/2, V5/2)a 0 in the x-y coordinates, where a 0 = = |a 2 | = 2.46 A is the lattice constant of 

graphene [7]. 

2) The term “chirality” is used for the “helicity” in some literature. The chiral angle (p is then 
defined as the angle between the tube axis and the helical (chiral) line. 



Chapter 6 


Semiconducting SWNTs 


The conduction of a single-wall carbon nanotube depends on the pitch. If there are an integral number 
of carbon hexagons per pitch, then the system is periodic along the tube axis and allows “holes” (not 
“electrons”) to move inside the tube. This case accounts for the semiconducting behavior with an 
activation energy of the order of 3 meV There is a distribution of the activation energy since the pitch 
and the circumference can vary. In other cases, SWNT show metallic behaviors. “Electrons” and 
“holes” can move in the graphene wall (two dimensions). The conduction in the wall is the same as in 
graphene if the finiteness of the circumference is disregarded. Cooper pairs formed by phonon 
exchange attraction moving in the wall are shown to generate a temperature-independent conduction 
at low temperature (3-20 K). 


6.1 Introduction 

Iijima [1] found after his electron diffraction analysis that the carbon nanotubes range from 4 to 30 nm 
in diameter, and consisted of helical multiwalled tubes. A single-wall nanotube (SWNT) is about 1 
nm in diameter and microns (pm) in length. Ebbesen et al. [2] measured the electrical conductivity a 
of carbon nanotubes and found that a varies depending on the temperature T, the tube radius r, and the 
pitch p. Experiments show that SWNTs can be either semiconducting or metallic, depending on how 
they are rolled up from the graphene sheets [3]. In this chapter we present a microscopic theory of the 
electrical conductivity of semiconducting SWNTs, starting with a graphene honeycomb lattice, 
developing Bloch electron dynamics based on a rectangular cell model [3], and using kinetic theory. 

A SWNT can be formed by rolling a graphene sheet into a circular cylinder. The graphene which 
forms a honeycomb lattice is intrinsically anisotropic as we shall explain in more detail in Section 
6.2. Moriyama et al. [4] fabricated 12 SWNT devices from one chip, and observed that two of the 
SWNT samples were semiconducting and the other ten metallic, the difference in the room- 
temperature resistance being of two to three orders of magnitude. The semiconducting SWNT samples 
show an activated-state temperature behavior. That is, the resistance decreases with increasing 
temperature. Why do these two sets of samples show very different behavior? The answer to this 
question arises as follows. The line passing the centers of the nearest-neighbor carbon hexagons 
forms a helical line around the nanotube with a pitch p and a radius r. In Figure 6.1a . a section of the 
circular tube with pitch p is drawn. Its unrolled plane is shown in Figure 6.1b . The circumference 2nr 
likely contains an integral number m of carbon hexagons (units). The pitch p, however, may or may 
not contain an integral number n of units. The pitch is not controlled in the fabrication process. In the 
first alternative, the nanotube is periodic with the minimum period p along the tube axis. Then, there 
is a one-dimensional (ID) k-vector along the tube. A “hole” which has a positive charge +e and a 
size of a unit ring of height p and radius r can move inside the positively charged carbon wall. An 




“electron” having a negative charge —e and a similar size is attracted by the carbon wall of positive 
C + ions, and hence it cannot travel in a straight line inside the wall. Thus, there should be an extra 
“hole” channel current in a SWNT. Moriyama et al. [4] observed a “hole”-like current after 
examining the gate voltage effect. The system should have the lowest energy if the unit ring contains 
an integer set (m, n) of carbon hexagons, which may be attained after annealing at high temperatures. 
This should occur if the tube length is comparable with the circumference. The experimental tube 
length is much greater (thousand times) than the circumference 2n r, and the pitch angle can be varied 
continuously. As has already been stated in the fabrication process the pitch is not controlled. The set 
of irrational numbers is greater in cardinality than the set of rational numbers. Then, the first case in 
which the unit contains an integer set (m, n) of hexagons must be in the minority. This case then 
generates a semiconducting transport behavior. We shall show later that the transport requires an 
activation energy. Fujita and Suzuki [7] showed that the “electrons” and “holes” must be activated 
based on the rectangular cell model for graphene. 


Figure 6.1 (a) A section of circular tube wall with a radius r and pitch p. (b) Its unrolled plane. 



Saito, Dresselhaus, and Dresselhaus [3] state that a SWNT is characterized by two integer indices 
(m, n), for example, m = n for an armchair nanotube whereas m = 0 for a zigzag nanotube. If n - m is a 
multiple of 3, then the SWNT is metallic. They then argued that approximately one third of SWNTs 
are metallic, and the other two thirds are semiconducting. This model is in variance with the 
experimental observation by Moriyama et al. [4], where the majority of SWNTs are metallic. We must 
look for a different classification scheme. 

If a SWNT contains an irrational number of carbon hexagons, which happens more often than not, 
then the system does not allow conduction along the tube axis. The system is still conductive since the 
conduction electrons (“electrons,” “holes”) can move inside the tube wall. This conduction is two- 
dimensional (2D), as opposed to ID, as can be seen in the unrolled configuration, which is precisely 
the graphene honeycomb lattice. This means that the conduction in the carbon wall should be the same 
as the conduction in graphene if the effect of the finiteness of the tube radius is neglected. 

We consider graphene in Section 6.2. The current band theory of the honeycomb crystal is based on 
the Wigner-Seitz (WS) cell model [3, 5]. The model applied to graphene predicts a gapless 
semiconductor, which is not observed. The WS model [5] is suitable for the study of the ground state 
energy of the crystal. To describe the Bloch electron dynamics [6] one must use a new theory based 
on a Cartesian unit cell which does not match the natural triangular crystal axes. Also phonons can be 
discussed naturally by using the Cartesian coordinate systems, and not with the triangular coordinate 
systems. The conduction electron moves as a wavepacket formed by Bloch waves as pointed out by 












Ashcroft and Mermin in their book [6]. This picture is fully incorporated into our theoretical model 

[7]. 


6.2 Single-Wall Nanotubes 

Let us consider the long SWNT rolled with a graphene sheet. The charge may be transported by the 
channeling “electrons” and “holes” in the graphene wall. But the “holes” within the wall surface also 
contribute to the transport of charge. Because of this extra channel inside the carbon nanotube, 
“holes” are the majority carriers in nanotubes although “electrons” are the dominant carriers in 
graphene. Moriyama et al. [4] observed electrical transport in a semiconducting SWNT in the 
temperature range 2.6-200 K, and found from the field (gate voltage) effect study that the carriers are 
“hole”-like. Their data are reproduced in Figure 6.2 . 

Figure 6.2 Log-scale plot of the currents in a semiconducting SWNT as a function of inverse 
temperature after Moriyama et al. [4], 

10K 5K 2.5K 



The conductivity depends on the pitch of the SWNT. The helical line is defined as the line passing 
through the centers of the nearest neighbors of the C + -hexagons. The helical angle (p is the angle 
between the helical line and the tube axis. The degree ofhelicity h may be defined as 

(6.1) h = cos <p . 

The conductivity a in (semiconducting) SWNTs depends on this helicity h. This is a kind of finite size 
effect. The circumference is finite while the tube length is macroscopic. 

In a four-valence electron system such as graphene all electrons are bound to ions and there is no 
conduction at 0 K. If a “hole” having the charge + e and the size of a unit cell is excited, then this 
“hole” can move along the tube axis with the activation energy e 3 and the effective mass m 3 . Both s 3 

and m 3 depend on the radius and the pitch. 

We are now ready to discuss the conductivity of a SWNT. There are four currents carried by 






(a) “Electrons” moving in the graphene wall with mass and density 


Biexp O) ■ 

running through the channels (110) (= [110], [Oil], and [101]). 

(b) “Holes” moving inside the graphene wall with the mass m 2 and the density 

Bieip (0) ' 

running through the channels (100) (= [100], [010], and [001]). 

(c) “Holes” moving with the mass m 3 and the density 


running in the tube-axis direction. The activation (or excitation) energy s 3 and the effective 
mass m 3 , vary with the radius and the pitch. 


For a macroscopically large graphene the conductivity does not show any directional 
dependence. The same easy channels in which the “electron” runs with a small mass, may be 
assumed for other hexagonal directions, [Oil] and [101], 

The currents run in the three channels {110) = [110], [011], and [101]. The total current 
(magnitude) along the field direction p is proportional to [10] 
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Hence, the total current does not depend on the angle # between the field direction p and the 
cha nne l current direction k (Problem 6.2.1). 

(d) Cooper pairs (pairons) formed by the phonon-exchange attraction, which move freely in the 
graphene wall. 

In actuality, one of the currents may be dominant, and be observed. In the normal Ohmic conduction 
because of the conduction electrons the resistance is proportional to the sample (tube) length. Then, 
the conductivity a is given by the Drude formula: 
nq 2 _ nq 2 1 
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where q is the carrier charge (±e), m* the effective mass, n the carrier density, and t the relaxation 
(collision) time. The relaxation rate T = t _ 1 is the inverse of the relaxation time. If the inpurities and 
phonons are scatterers, then the rate T is the sum of the impurity scattering rate T jmp and the phonon 

scattering rate r ph (T): 


(6.4) ^ ~ ^ m P ^ph(P) ■ 

The inpurity scattering rate T jmp is temperature-independent and the phonon scattering rate r ph (T) is 
temperature (T)-dependent. The phonon scattering rate r ph (T) is linear in T above around 2 K: 


(5 5) .Tph(T) — aT , a — constant, (above 2 K) . 

The temperature dependence should arise from the carrier density n(T) and the phonon scattering rate 
T ph (T). Writing the T-dependence explicitly, we obtain from Equations 16.31 and 16.51 
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where the carrier density n is replaced by the thermally activated (or excited) electron densities nj(T): 

ren(T) = nt exp (—— ] , i = 1 “electron” and j = 2 “hole" . 

(6.7) V W) 


The prefactor nj is the density at high temperature. 

Moriyama et al. [4] used the Arrhenius plot for the data above 20 K and obtained the activation 
energy 


t6.81 H ~ 3 meV. 

By studying the field (gate voltage) effect, the carriers were found to be hole-like. Thus, the major 
currents observed can be interpreted in terms of the “holes” moving within the tube wall. 

This “hole” axial transport depends on the unit ring containing m x n hexagons. Since the pitch and 
the circumference have distributions, the activation energy s 3 should also have a distribution. Hence, 

the obtained value in f6.8^ must be regarded as the averaged value. 

Liu et al. [11] systematically measured the resistance p(T) of SWNTs under hydrostatic pressures, 
and fitted their data by using a 2D variable range hopping (vrh) theoretical formula [12]: 
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Here p 0 is a (resistance) parameter and 


(6.10) T 0 = 525 K 

is a (temperature) fit parameter. Mott’s vrh theory [12] is applicable when highly random disorders 
exist in the system An individual SWNT (annealed) is unlikely to have such randomness. We take a 
different view here. The scattering is due to normally assumed inpurities and phonons. But carriers 
(“holes”) have a distribution in the unit cell size. Hence, the distribution of the activation energy 
introduces the flattening of the Arrhenius slope by the factor 1/3. 

We now go back to the data shown in Figure 6.2 . Below 20 K the currents observed are very small 
and they appear to approach a constant in the low-temperature limit (large T _l limit). These currents, 
we believe, are due to the Cooper pairs. 

The Cooper pairs (pairons) move in 2D with the linear dispersion relation [13]: 
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where v F ® 


is the Fermi velocity of the “electron” (j = 1) and of the “hole” (j = 2). 


Consider first “electron” pairs. The velocity v is given by (omitting superscript) 

dp p x 

v = —— or v x = t —~~~ = c— , 
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The equation of motion along the E-field (x-)direction is 
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where q' is the charge ±2e of a pairon. The solution of I6.15I is given by 
f6.16t P* + - 
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where p x ° is the initial momentum component. The current density j p is calculated from q' (charge) 
np (number density) x ^ (average velocity). The average velocity j? is calculated by using f6.13I and 
f6.16t with the assumption that the pair is accelerated only for the collision time t. We obtain 


j P = g'«p v 


1 C 

— q'n ? c— (q'Er) = q' 1 —n v Br 
(6.17) P P P P 


For stationary currents, the pairon density rip is given by the Bose distribution function f(s p ) 
(6.18) n? ~ f {C?) ~ eKp(£ p /(fe B T)-a)- 1 r 


where e a is the fiigacity. Integrating the current j p over all 2D p-space, and using Ohm’s law j = oE 
we obtain for the conductivity a (Problem 6.2.2): 

a = — / d 2 p p ~ 1 f(£n)r . 

16.191 {Ixhf J pp J{pi 

In the temperature ranging between 2 and 20 K we may assume the Boltzmann distribution function 
for f(Sp): 

(6.20) ^ (Ep) ~ ) 

We assume that the relaxation time arises from the phonon scattering so that t = (aT) -1 , see 
f6.3W6.5T After performing the p-integration and setting q' = - 2e for the pairon charge, we obtain 
(Problem 6.2.3) 

l e 2 fcs a 

f6,21f x, ah 2 

We note that this a is temperature-independent. If there are “electron” and “hole” pairs, they 
contribute additively to the conductivity. These pairons should undergo a Bose-Einstein condensation 
at a temperature lower than 2.2 K. We predict a superconducting state at lower temperatures. 

Problem 6.2.1. Show that cos 2 # + cos 2 (# + 2n/3) + cos 2 ^ - 2n/3) = 3/2. 

Problem 6.2.2. Derive (6.19) . 

Problem 6.2.3. Derive Equation f6.21T 


6.3 Summary and Discussion 

A SWNT is likely to have an integral number of carbon hexagons around the circumference. If each 
pitch contains an integral number of hexagons, then the system is periodic along the tube axis, and 
“holes” (not “electrons”) can move along the tube axis. The system is semiconducting with an 
activation energy s 3 . This energy e 3 has a distribution since both pitch and circumference have 
















distributions. The pitch angle is not controlled in the fabrication process. There are numerous other 
cases where the pitch contains an irrational numbers of hexagons. In these cases the system shows a 
metallic behavior experimentally [14]. 

In the process of arriving at our main conclusion we have uncovered the following results: 

• “Electrons” and “holes” can move in 2D in the carbon wall in the same manner as in graphene. 

• The above implies that the conduction in the wall shows no pitch dependence for a long SWNT. 

• The Cooper pairs are formed in the wall. They should undergo BEC at low temperatures, 
exhibiting a superconducting state. 

A metallic SWNT will be treated in Chapter 8. 
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Chapter 7 


Superconductivity 


We describe the basic properties of superconductors, occurrence of superconductors, theoretical 
background, and quantum statistical theory of superconductivity in this chapter. 


7.1 Basic Properties of a Superconductor 

Superconductivity is characterized by the following six basic properties: zero resistance, Meissner 
effect, magnetic flux quantization, Josephson effects, gaps in elementary excitation energy spectra, and 
sharp phase change. We shall briefly describe these properties in this section. 


7.1.1 Zero Resistance 

The phenomenon of superconductivity was discovered in 1911 by Kamerlingh Onnes [1], who 
measured extremely small electric resistance in mercury below a certain critical temperature T c (~ 

4.2 K). His data are reproduced in Figure 7.1 . This zero resistance property can be confirmed by the 
never-decaying supercurrent ring experiment described in Section 7.1.3. 


Figure 7.1 Resistance (in ohm) versus temperature (in kelvin), after Kamerlingh Onnes [1]. 



7.1.2 Meissner Effect 

Substances that become superconducting at finite temperatures will be called superconductors. If a 
superconductor below T c is placed under a weak magnetic field, it repels the magnetic field B 

completely from its interior as shown in Figure 7.2a . This is called the Meissner effect, which was 
discovered by Meissner and Ochsenfeld [2] in 1933. 

















Figure 7.2 The Meissner effect, (a) A superconductor repels a weak magnetic field from its body 
below the transition temperature (T <T C ) while (b) at a temperature above the transition temperature 
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The Meissner effect can be demonstrated dramatically by a floating magnet as shown in Figure 7.3 . 
A small bar magnet above T c simply rests on a superconductor dish. If the temperature is lowered 

below T c , then the magnet will float as indicated. The gravitational force exerted on the magnet is 

balanced by the magnetic pressure (part of the electromagnetic stress tensor) due to the 
nonhomogeneous magnetic field (B-field) surrounding the magnet, which is represented by the 
magnetic flux lines. 

Figure 7.3 A floating magnet. 



Later more refined experiments reveal that a small magnetic field penetrates into a very thin surface 
layer of the superconductor. Consider the boundary of a semi-infinite slab. When an external field is 
applied parallel to the boundary, the B-field falls off exponentially: 

(7.1) £(*) = 0)e-*' A , 

as indicated in Figure 7.4 . Here X is called a penetration depth, which is of the order of 500 A in 
most superconductors at lowest temperatures. Its small value on a macroscopic scale allows us to 
describe the superconductor as being perfectly diamagnetic. The penetration depth X plays a very 
important role in the description of the magnetic properties. 


(T > T c ) the magnetic field penetrates its body. 



fb) 


Figure 7.4 Penetration of the magnetic field into a superconductor slab. The penetration depth X is of 
the order of 500 A near 0 K. 
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7.1.3 Ring Supercurrent and Flux Quantization 

Let us take a ring-shaped cylindrical superconductor. If a weak magnetic field B is applied along the 
ring axis and the temperature is lowered below T c , then the field is expelled from the ring due to the 

Meissner effect. If the field is slowly reduced to zero, part of the magnetic flux lines can be trapped 
as shown in Figure 7.5 . The magnetic moment generated is found to be maintained by a never- 
decaying supercurrent flowing around the ring [3]. 

Figure 7.5 A set of magnetic flux lines are trapped in the ring. 



More delicate experiments [4, 5] show that the magnetic flux enclosed by the ring is quantized as 

(j2)® = n& 0 , » = 0 , 1,2 . 

d> 0 = Jl = — = 2.07x 10 -7 Gcm 2 . 

(7.3) 2e e 

® 0 is called a (Cooper pair) flux quantum. The experimental data obtained by Deaver and Fairbank 
[4] are shown in Figure 7.6 . The superconductor exhibits a quantum state represented by the quantum 
number n. 

Figure 7.6 The magnetic flux quantization, after Deaver and Fairbank [4]. The two sets of data are 
shown as A and o- 

































7.1.4 Josephson Effects 

Let us take two superconductors (S 1? S 2 ) separated by an oxide layer of thickness on the order of 10 

A, called a Josephson junction. This system as part of a circuit including a battery is shown in Figure 
7.7 . Above T c , the two superconductors, Sj and S 2 , and the junction I all show potential drops. If the 

temperature is lowered beyond T c , the potential drops in S, and S 2 disappear because of zero 

resistance. The potential drop across junction I also disappears! In other words, the supercurrent runs 
through junction I with no energy loss. Josephson predicted [6], and later experiments [7] confirmed, 
this Josephson tunneling, also called a DC Josephson effect. 

Figure 7.7 Two superconductors S, and S 2 , and a Josephson junction I are connected to a battery. 



Let us take a closed loop superconductor containing two similar Josephson junctions and make a 
circuit as shown in Figure 7.8 . Below T c , the supercurrent I branches out into I, and I 2 . We now apply 

a magnetic field B perpendicular to the loop. The magnetic flux can go through the junctions, and the 
field can be changed continuously. The total current I is found to have an oscillatory component: 


Figure 7.8 Superconducting quantum interference device (SQUID). 
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(7.4 ) 1 



= constant), 


where ® is the magnetic flux enclosed by the loop, indicating that the two supercurrents lj and I 2 , 

macroscopically separated (~ 1 mm), interfere just as two laser beams coming from the same source. 
This is called a Josephson interference. A sketch of the interference pattern [8] is shown in Figure 


L9. 


Figure 7.9 Current versus magnetic field, after Jaklevic et al. [8]. 
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The circuit in Figure 7.8 can be used to detect an extremely weak magnetic field. This device is 
called the Superconducting Quantum Interference Device (SQUID). 

In the thermodynamic equilibrium, there can be no currents, super or normal. Thus, we must deal 
with a nonequilibrium condition when discussing the basic properties of superconductors such as 
zero resistance, flux quantization, and Josephson effects. All of these arise from the supercurrents that 
dominate the transport and magnetic phenomena. When a superconductor is used to form a circuit with 
a battery, and a steady state is established, all current passing the superconductor are supercurrents. 
Normal currents due to the moving electrons and other charged particles do not show up because no 
voltage difference can be developed in a homogeneous superconductor. 

7.1.5 Energy Cap 

If a continuous band of the excitation energy is separated by a finite gap s g from the discrete ground 

state energy level as shown in Figure 7.10 . then this gap can be detected by photo-absorption [9, 10], 
quantum tunneling [11], heat capacity [12], and other experiments. This energy gap s g is found to be 

temperature-dependent. The energy gap e g (T) as determined from the tunneling experiments [13] is 
shown in Figure 7.11 . The energy gap is zero at T c , and reaches a maximum value s g (0) as the 
temperature approaches 0 K. 

Figure 7.10 Excitation energy spectrum with a gap. 
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Figure 7.11 The energy gap s g (T) versus temperature, as determined by tunneling experiments, after 
Giaver and Megerle [13]. 















7.1.6 Sharp Phase Change 

The superconducting transition is a sharp phase change. In Figure 7.12 . the data of the electronic heat 
capacity C el plotted as C el /T against T, as reported by Loram et al. [14] for YBA 2 Cu0 6+x (2D 

superconductor) with the x-values, are shown. The data at x = 0.92 have the highest T c . There are no 
latent heat and no discontinuity in C el at T c . Below T c there is a complex long-range order, which may 

be treated by the Ginzburg-Landau theory [15]. For a 3D superconductor such as lead (Pb), there is a 
jump in the heat capacity and the phase change is of the second order (no latent heat). 

Figure 7.12 Electronic heat capacity C el /T over temperature T, that is y(x, T) = C el /T is plotted 
against T for YBa 2 Cu 3 0 6+x relative to YBa 2 Cu 3 0 6 . Values of x are 0.16, 0.29, 0.38, 0.43, 0.48, 0.57, 
0.67, 0.76, 0.80, 0.87, 0.92, and 0.97, after Loram et. al. [14]. 
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7.2 Occurrence of a Superconductor 

The occurrence of superconductors is discussed in this section. 





















7.2.1 Elemental Superconductors 

More than 40 elements become superconducting at the lowest temperatures. Table 7.1 shows the 
critical temperature T c and the critical magnetic fields B c at 0 K. Most nonmagnetic metals tend to be 

superconductors, with notable exceptions being monovalent metals such as Li, Na, K, Cu, Ag, and Au. 
Some metals can become superconductors under applied pressures and/or in thin films, and these are 
indicated by asterisks in Table 7.1 . 


Table 7.1 Superconductivity parameters of the elements. 
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An asterisk f*) denotes superconductivity in thin films or under high pressure. 
Transition temperature lyinkeLvin and critical magnetic field B t at 0 K in gauss. 


7.2.2 Compound Superconductors 

Thousands of metallic compounds are found to be superconductors. A selection of compound 
superconductors with critical temperature T c are shown in Table 7.2 . Note: the critical temperature 

T c tends to be higher in compounds than in elements. Nb 3 Ge has the highest T c (~ 23 K). 

Table 7.2 Critical temperatures of selected compounds. 
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Compound superconductors exhibit type II magnetic behavior different from that of type I elemental 




















































superconductors. A very weak magnetic field is expelled from the body (the Meissner effect) just as 
by the type I superconductor. If the field is raised beyond the lower critical field H cl , the body allows 

a partial penetration of the field, still remaining in the superconducting state. A further field increase 
turns the body to a normal state upon passing the upper critical field H c2 . Between H cl and H c2 , the 

superconductor is in a mixed state in which magnetic flux lines surrounded by supercurrents, called 
vertices, penetrate the body. The critical fields vs. temperature are shown in Figure 7.13 . The upper 
critical field H c2 can be very high (20T = 2.0 x 10 5 G for Nb 3 Sn). Also the critical temperature T c 
tends to be high for high- H c2 superconductors. These properties make compound superconductors 
useful for devices and magnets. 


Figure 7.13 Phase diagram of type I (a) and type II (b) superconductors. 



7.2.3 High-T c Superconductors 

In 1986 Bendnorz and Muller [16] reported the discovery of the first cuprate superconductors, also 
called high-temperature superconductors (HTSC). Since then, many investigations have been carried 
out on the high-T c superconductors including YBACuO with T c ~ 94 K [17]. The boiling point of 

abundantly available and inexpensive liquid nitrogen (N) is 77 K. So the potential application of 
HTSC’s, which are of type II, appears to be great. The superconducting state of these conductors is 
essentially the same as that of elemental superconductors. 


7.3 Theoretical Survey 

We review briefly the current theories of superconductivity. 

7.3.1 The Cause of Superconductivity 

At present superconductivity in solids is believed to be caused by a phonon-exchange attraction. 
When a phonon is exchanged between two electrons, a bound electron pair, called a Cooper pair 
(pairon), is formed [18]. 

The exchange of a boson (phonon) between the two fermions (electrons) can be pictured as the 
emission of a boson by a fermion and the subsequent absorption of the boson by another fermion. To 
describe the emission and absorption of the boson requires a new theory, called a second- 










quantization formulation in which creation and annihilation operators are introduced. The second- 
quantization formulation is summarized in Appendix A. 1. The electron-phonon interaction and the 
phonon-exchange attraction are discussed in Chapter 4 (Sections 4.3 and 4.4). 


7.3.2 The Bardeen-Cooper-Schrieffer Theory 

In 1957 Bardeen, Cooper, and Schrieffer (BCS) published a classic paper [19] that is regarded as 
one of the most important theoretical works in the twentieth century. The Nobel Prize for physics in 
1972 was shared by Bardeen, Cooper, and Schrieffer for this work. We shall briefly review the BCS 
theory. 

In spite of the Coulomb repulsion among electrons there exists a sharp Fermi surface for the normal 
state of a conductor, as described by the Fermi liquid model of Landau [20, 21] (see Chapter 3, 
Section 3.4). The phonon-exchange attraction can bind pairs of electrons near the Fermi surface 
within a distance (energy) equal to Planck constant h times the Debye frequency co D . The bound 

electron pairs, each having antiparallel spins and charge (magnitude) 2e, are called Cooper pairs 
(Pairons). “Cooper pair” and “pairon” both denote the same entity. When we emphasize the 
quasiparticle aspect rather than the two electron composition aspect, we use the term “pairon” more 
often in this text. 

BCS started with a Hamiltonian Mbcs i n the form: 


Hbcs = H Ek c l* ck * + H 


Et >o 


k s 


17.51 *1 II *4 14 

where s ki = S| is the kinetic energy of a free electron measured relative to the Fermi energy s F and 
c k 1 si^( c k 1 s 1 ) = c i^( c i) are creation (annihilation) operators satisfying the Fermi anticommutation rules: 

j c Jtf l -y J r j c ks c ]fr^ + c \'s> , 

(7 5 ) {cki.cx’A — = 0 ' 

The first (second) sum on the rhs of (7.5) represents the total kinetic energy of “electrons” with 
positive s k (“holes” with negative e k ). The matrix element {l,2|y|3,4) denotes the net interaction arising 

from the virtual exchange of a phonon and the Coulomb repulsion between electrons. Specifically, 


{1,2|V|3,4> = 


0 


VqV iffrrnaxl < 

otherwise , 


(7.7) 

where y 0 is a constant (energy) and V is a sample volume. 


Starting with the Hamiltonian (7.5) . BCS obtained an expression W for the ground state energy: 
(7.8) W = A<wd27(0)wo = Nou>o , 
where 


Wt) = 



(!* = VoV - ’) , 


(7.9) 









is the pairon ground state energy, 


(7.10) N 0 = Afi> D P(0) 

the total number of pairons, and p(0) the density of states (DOS) per spin at the Fermi energy. In the 
variational calculation of the ground state energy, BCS found that the unpaired electrons, often called 
the quasi electrons^- 1 , not joining the ground pairons that form the supercondensate, have the energy 
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where s k is the kinetic energy of a free “electron” relative to the Fermi energy. 


The energy constant A, called the quasielectron energy gap, in (7.111 is greatest at 0 K and 
decreases to zero as the temperature is raised to the critical temperature T c . BCS further showed that 

the energy gap at 0 K, A(T = 0) = A 0 and the critical temperature T c are related (in the weak coupling 
limit) by 

(7.121 2zl 0 = 3.53fc B T c . 

These findings of (7.81-(7.121 are among the most important results obtained in the BCS theory. A 
large body of theoretical and experimental work followed several years after the BCS theory. By 
1964 the general consensus was that the BCS theory is an essentially correct theory of 
superconductivity. 

BCS assumed the Hamiltonian (7.51 containing “electron” and “hole” kinetic energies. They also 
assumed a spherical Fermi surface. However, these two assumptions contradict each other. If a Fermi 
sphere whose inside (outside) is filled with electrons is assumed, then there are “electrons” (“holes”) 
only (see Chapter 3, Section 3.5). Besides this logical inconsistency, if a free electron model having a 
spherical Fermi surface is assumed, then the question of why metals such as sodium (Na) and 
potassium (K) remain normal to the lowest temperatures cannot be answered. We must incorporate 
the band structures of electrons more explicitly. We shall discuss a generalization of the BCS 
Hamiltonian in Sections 7.4 and 7.7. 


7.3.3 Quantum Statistical Theory 

In a quantum statistical theory one starts with a reasonable Hamiltonian and derives everything from 
this, following step-by-step calculations. Only Heisenberg’s equation of motion (quantum mechanics), 
Pauli’s exclusion principle (quantum statistics), and Boltzmann’s statistical principle (grand 
canonical ensemble theory) are assumed. 

The major superconducting properties were enumerated in Section 7.1. The purpose of a 
microscopic theory is to explain all of these from first principles, starting with a reasonable 
Hamiltonian. Besides, one must answer basic questions such as: 

• What causes superconductivity? The answer is the phonon-exchange attraction. We have already 
discussed this interaction in Chapter 4. It generates Cooper pairs [18], called pairons for short, 
under certain conditions. 

• Why do inpurities that must exist in any superconductor not hinder the supercurrent? Why is the 
supercurrent stable against an applied voltage? Why does increasing the magnetic field destroy 
the superconducting state? 









• Why does the supercurrent dominate the normal current in the steady state? 

• What is the supercondensate whose motion generates the supercurrent? How does magnetic-flux 
quantization arise? Josephson interference indicates that two supercurrents can interfere 
macroscopicallyjust as two lasers from the same source. Where does this property come from? 

• Below the critical temperature T c , there is a profound change in the behavior of the electrons as 

shown by Bardeen, Cooper, and Schrieffer in their classical work [19]. What is the cause of the 
energy gap? Why does the energy gap depend on the temperature? Can the gap A(T) be observed 
directly? 

• Phonons can be exchanged between any electrons at all times and at all temperatures. The 
phonon-exchange attraction can bind a pair of quasielectrons to form moving (or excited) 
pairons. What is the energy of excited pairons? How do the moving pairons affect the low- 
temperature behavior of the superconductor? 

• All superconductors behave alike below T c . Why does the law of corresponding states work 

here? Why is the supercurrent temperature- and material-independent? 

• What is the nature of the superconducting transition? Does the transition depend on 
dimensionality? 

• About half of all elemental metals are superconductors. Why does Na remain normal down to 0 
K? What is the criterion for superconductivity? What is the connection between 
superconductivity and band structures? 

• Compound, organic, and high-T superconductors in general show type II magnetic behaviors. 

Why do they behave differently compared with type I elemental superconductors? 

• All superconductors exhibit six basic properties: (1) zero resistance, (2) Meissner effects, (3) 
flux quantization, (4) Josephson effects, (5) gaps in the elementary excitation energy spectra, and 
(6) sharp phase change. Can a quantum statistical theory explain all types of superconductors in a 
unified manner? 

• Below 2.2 K, liquid He 4 exhibits a superfluid phase in which the superfluid can flow without a 
viscous resistance, the flow property remarkably similar to the supercurrent. Why and how does 
this similarity arise? 

We shall discuss a generalization of the BCS Hamiltonian incorporating the band structures of 
electrons more explicitly in the following sections. 

7.4 Quantum Statistical Theory of 
Superconductivity 

Fujita and his group developed a quantum statistical theory of superconductivity in a series of papers 
[22-26], We present this theory in the following sections. 

We construct a generalized BCS Hamiltonian which contains the kinetic energies of “electrons” and 
“holes,” and the pairing Hamiltonian arising from phonon-exchange attraction and Coulomb 
repulsion. We follow the original BCS theory to construct a many-pairon ground state and find a 
ground state energy. 



7.4.1 The Generalized BCS Hamiltonian 


BCS assumed a Hamiltonian containing “electron” and “hole” kinetic energies and a pairing 
interaction, see (7.5) . They also assumed a spherical Fermi surface. But if we assume a free electron 
model, we cannot explain why only some, and not all, metals are superconductors. We must 
incorporate the band structures of electrons explicitly. In this section we set up and discuss a 
generalized BCS Hamiltonian developed by Fujita and his group [22-26]. 

We assume that 

• In spite of the Coulomb interaction, there exists a sharp Fermi surface at 0 K for the normal state 
of a conductor (the Fermi liquid model, see Chapter 3). 

• The phonon exchange attraction can bind Cooper pairs near the Fermi surface within a distance 
(energy) equal to the Planck’s Constance h times the Debye frequency co D . 

• “Electrons” and “holes” have different effective masses (magnitude). Thus ^ s k (2 \ 

• The pairing interaction strengths among and between “electron” (1) and “hole” (2) pairons are 
different so that 


— Vtj V 1 3+*4 ^*1,53 d Jj,S 4 
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otherwise. 
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We note that “electrons” and “holes” are different quasiparticles, which will be denoted distinctly. 

In the ground state there are no currents for any system To describe a ring supercurrent that can run 
indefinitely at 0 K, we must introduce moving pairons, that is, pairons with finite center of mass (CM) 
momenta. Creation (annihilation) operators for “electron” (1) and “hole” (2) pairons are respectively 
defined by^ 

R mt _ r u)t _ p( 2 )t _ r p)t J 2 )t 

(7.141 ~ — C l C 2 ’ "34 ~ C 4 C 1 ■ 

The pairon operators are denoted by B’s, which should not be contused with the magnetic field B (= 
|B|). Odd-numbered “electrons” carry up-spin f and even-numbered “electrons” carry down-spin j. 
By using the Fermi anticommutation rules for c’s and c^’s, the commutators among B’s and B’ ’s are 
given by (Problem 7.4.1) 
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if Jtj = Jtj and k 2 = Ic* 
if fci = Jtj and k 2 ^ k*. 
if ki ^4 fcj and k 2 — Ic* 
otherwise . 


Pairon operators of different types j always commute: 

(7.17)[ b “" bW ] = ° 

Here 
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_ JJ> _ ,U)1 r U) 

n 2 — n ki,s - c * 2 4 c *i4 


represent the number operators for “electrons” (j = 1) and “holes” (j = 2). 
Let us now introduce the relative and net (or CM) momenta (k, q) such that 








fe+ f 


k = |(ti - k 2 ) ) ( ki = k + | 

(7.19) 9 — ki + ^2 j t fc 2 = -1 
The pairon annihilation operators are then alternatively represented by 

r'P) _ R (1J _ Jl) Jl> r/(2) _ J2) (2) 

(7.20) “ -H-a/li JH-S/2t ' H ~ C k+qj2f C -k+ql2± 1 

The prime on B’s will be dropped hereafter. In the k-q representation the commutation relations 
17.151 and 17.161 are re-expressed as (Problem 7.4.2) 

NMy] = °- KT = °. 


r dU! j'lti _ 

17.211 UV^WJ - 

r (l - Kk+qj 2 t - n-n+^tfdjt 

6, 


if k — if and q = q f 
if fe + | = )^ + i 
and — t + | ^ — \f + i 

if i + + i 

and — fc + | = -fc / + i 
otherwise . 

Using the new notation for pairons, we now write the generalized BCS Hamiltonian w (Problem 
7.4.3): 

ffl 


17.221 


r tn jm 

C -k+qtll C -k'+fiil 


C U) C im dn 
C k+ql2i C k'+^/2t°J i 

0 


« - -E'E'E' M’X 

k,s k,s k k' q 


nPJt n(2)t , „ r < 2) nP> , „ rW R (2)f\ 

( 222 } + B *« + ^ ^ + ^ 2 J ’ 

where (= V -1 ^) is the pairing interaction strengths^ per volume among and between “electron” (1) 
and “hole” (2) pairons. n ks ^^ and n ks ^ are the number operators for “electrons” and “holes” 


respectively, defined by 17.181 . It should be noted that the q-summation is taken over all momenta 
including the zero momentum The Hamiltonian 17.231 . called the generalized BCS Hamiltonian, 
expresses the complete Hamiltonian for the system, which can describe moving pairons as well as 
stationary pairons. Here, the primes on the summation symbols indicate the restriction arising from 
the phonon-exchange attraction. 

Problem 7.4.1. Verify 17.151 and 17.161 . 

Problem 7.4.2. Verify 17.211 and 17.221 . 

Problem 7.4.3. Derive 17.231 . 


7.5 The Cooper Pair Problem 

In 1956 Cooper demonstrated [13] that, however weak the attraction may be, two electrons just above 
the Fermi sea can be bound. The binding energy is greater if the two electrons have opposite momenta 
(p, - p) and antiparallel spins (f, j). The lowest bound energy w 0 is found to be 

W/i = - 

17.241 ®tp(2/(i*2?(0))) - 1 1 


















where co D is the Debye frequency, v 0 a positive constant characterizing the attraction, and p(0) the 

electron density of states per spin at the Fermi energy. If electrons having energy nearly opposite 
momenta (p, -p + q) are paired, the binding energy is less than |w 0 |. For small q, which represents the 

net momentum (magnitude) of a pairon, the energy-momentum relation, also called the dispersion 
relation, is given in the form: 

17,251 w q = m+cq <0, 

where c/v F =1/2 (2 /tt) for 3D (2D) and v F = (2e F /m*) 1/2 is the Fermi speed. Equations 17.241 and 

(7.25! play very important roles in the theory of superconductivity. We shall derive these equations in 
this and the next sections. 

Two electrons near the Fermi surface can gain attraction by exchanging a phonon. This attraction 
can generate a bound electron pair. We shall look for the ground state energy of the Cooper pair 
(pairon). We anticipate that the energy is lowest for the pairon with zero net momentum. Moving 
pairons will be considered in the following section. 

We consider a 2D system. This will simplify the concept and calculations. The 3D case can be 
treated similarly. Let us take two electrons just above the Fermi surface (circle), one electron having 
momentum k and up-spin f and the other having momentum -k and down-spin j, see Figure 7.14 . We 
measure the energy relative to the Fermi energy e F : 


Figure 7.14 A stationary Cooper pair having zero net momentum. 



The sum of the kinetic energies of the two electrons is 2e k . By exchanging a phonon, the pair’s 
momenta change from (k, -k) to (k% -k’). This process lowers the energy of the pair. 

The energy-eigenvalue equation for the 2D case is obtained from [13]: 

(7.7.7 

where w 0 is the pairon ground state energy, and A(k) the waveftmction; the prime on the integral sign 
means the restriction: 


(7.28) 0 < e* < haiu ■ 

Equation 17.271 can be solved simply as follows. Consider the integral: 


C^^_ 
(7.291 (2**) 2 


J d 2 k'A(k'), 


which is a constant. Assume that the energy w 0 is negative: 


(7.30) w 0 <0. 
















Then, 2s k - w 0 = 2s k + |w 0 | > 0. After rearranging the terms in (7.27) and dividing the result by 2e k + 
|w 0 |, we obtain 

A(t) - l ——-C . 

(7.31) 2e* + K| 

Substituting this expression into 17.29) and dropping the common factor C, we obtain 

_]_ 

(7.32) [Ink) 1 } 2c t +K|' 

By introducing the density of states at the Fermi energy, u(0), we can evaluate the k-integral as 
follows: 

1 = ^<°> / d£ 2TTw = 111 (iST + 0 • 

Solving this equation, we obtain (Problem 7.5.1) 

— 2 fotOjy 

CU3) = «P(2/(m»(0|)) -1 • 

We thus find a negative energy for the stationary pairon. The v 0 -dependence of the energy w 0 is 
noteworthy. Since exp(2/x) cannot be expanded in powers of x = v 0 £>(0), the energy w 0 cannot be 
obtained by a perturbation (v 0 -expansion) method. We note that formula (7.33) also holds for the 3D 

case with a 3D density of states.^ 

Problem 7.5.1. Verify (733). 


7.6 Moving Pairons 


r j d 2 k'a(k', q), 


The phonon-exchange attraction is in action for any pair of electrons near the Fermi surface. In 
general the bound pair has a net momentum, and hence, it moves. Such a pair is called a moving 
pairon. The energy w q of a moving pairon for the 2D case can be obtained from a generalization of 

(7.27) : 

«y»(M) = [ £ (| k + f |) + e (|-* + \ 

(7.34) (2**) 

which is Cooper’s equation, Eq. (1) of his 1956 Physical Review [18]. The prime on the k’-integral 
means the resrtiction on the integration domain arising from the phonon-exchange attraction, see 
below. We note that the net momentum q is a constant of motion, which arises from the fact that the 
phonon exchange is an internal process, and hence cannot change the net momentum. The pair 
wavefunctions a(k, q) are coupled with respect to the other variable k, meaning that the exact (or 
energy-eigenstate) pairon wavefunctions are superpositions of the pair wavefunctions a(k, q). 

The Cooper eigenvalue equation (7.34) can be derived, starting with a many-body Hamiltonian. A 
derivation of Cooper’s equation (7.34) is given in Appendix A.3. We note that (7.34) is reduced to 

(7.27) in the small q-limit. The latter equation was solved in the previous section. 

Let us solve (7.34) for w q by using the same technique to obtain the pairon ground state energy (that 






















is, the energy of a stationary pairon). We assume that the energy w q is negative: w q < 0. Then, e(|k + 
q/2|) + e(| - k + q/2|) — w q > 0. Rearranging the terms in t7.34) and dividing by s(k + q/2|) + s(|k + 
q/2|) - w q , we obtain 

£735) o(l ’ q) = V (1 1V + II) + e (l“ k '+ II) “ “’•] ’ C <«> ' 
where 


VO 


(7.36) qg) V**) 2 


J d 2 k f a{k / ,q) 


which is k-independent. 

Introducing (7.35) in 17.34) . and dropping the common factor C(q), we obtain 


1 = 


Vo 


f dlk [ e (I 1+ 11) +c i\~ k + II) + M 


(7.37) 

We now assume a 2D tree electron model. The Fermi surface is a circle of the radius (momentum): 

(7.38) fe F = |k F | = (2mi£ F )^ 2 , 

where m 1 represents the effective mass of an “electron.” The energy s(|k|) is given by 


*(|k|) = 


|k | 2 - |k F | J 


(7.39) ” " 2mi 

The prime on the k-integral means the restriction: 


(7.40) ° € £ (l fc + 11) • c (|- k + ll) < ‘ <UD - 

We may choose the z-axis along q as shown in Figure 7.15 . We assume a small q and keep terms up 
to the first order in q. The k-integral can then be expressed by (Problem 7.6.1) 


Figure 7.15 The range of the interaction variables (k, 0) is restricted to the circular shell of thickness 
k D . 
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fcdt 


w q \ + (fc 3 - fc|) / mi 

J d6\n \ w ^ + 2 ^ ( ° d ~ Vf ^ cos& 


0 Jtp+^cQse 

njl 

= 2 mi 


(7.4 n S |u>,| + i>f4cos0 

(7.42) to = . 

where we retained the linear term in k D /k F only since k D « k F . After performing the 0-integration, we 

obtain (Problem 7.6.2) 

2 

t7.43^ Wq = W ° + n ^ t2D>> 

where w 0 is given by (7.33) . A similar result for a 3D case, 


(7 44) = *<> + ( 3D ) 

was first obtained by Cooper (but unpublished). It is recorded in Schrieffer’s book [27], 12.15k It 
should be noted that the lowest bound energies w 0 in t7.44I for 2D and in 17.441 for 3D are of the 

same form: 


_ -2fea) D 
W ° “ eK P (2/(^P(0))) - 1 ‘ 

It depends, however, on the specific form of the density of states xftO) at the Fermi momenta for 2D 
and 3D, respectively. As expected, we can see that the zero-momentum pairon has the lowest energy. 
The excitation energy is continuous with no energy gap. The energy w q increases linearly with 

momentum q for small q, rather than quadratically. This arises since the pairon density of states is 
strongly reduced with increasing momentum q, and this behavior dominates the q 2 -increase of the 
kinetic energy. Pairons move like massless particles with a common speed 21 n v F for two dimensions 

and v f /2 for three dimensions. The linear dispersion relation^ plays a vital role in the Bose-Einstein 
condensation (BEC) of pairons (see Section 7.9). 

Problem 7.6.1. Verify (7.41k Hint: Use the diagram in Figure 7.15 . 

Problem 7.6.2. Derive an energy-momentum relation 17.431 for 2D. 


7.7 The BCS Ground State 

7.7.1 The Reduced Generalized BCS Hamiltonian 

At 0 K there are only iground state pairons, that is, pairons having the lowest energies. The ground 
state I'P) for the system may then be constructed based on the reduced Hamiltonian ^ red , which can be 

represented in terms of pairon operators, b’s, only^ (Problem 7.7.1): 


















=e mw+e - e'e' w 

n.4tt +-»W t + *i»W + ^« r ) • 

where v^ = V -1 Vij and b® are pair annihilation operators. The primes (’) on the summation symbols 
indicate the restriction^ arising from the phonon-exchange attraction. The reduced Hamiltonian ^ red is 

bilinear in pairon operators (b, b^), and can be diagonalized exactly. Note that corresponding to 
t7.211 and (7.22) . the commutation relations for pair operators, b’s, are given by (Problem 7.7.2) 

[f-< J >,*«] = b u) h n _ fcfflfcUJ = o, [*u>] a = fcUifcU) = o , 

,7,46, [•’»'■ *pf] = (i - -S? - . j. * -1.2 ■ 

For the sake of argument, let us drop the interaction Hamiltonian altogether in (7.45) . We then have 
the first two sums representing the kinetic energies of “electrons” and “holes.” We note that these 
energies (sj/ 1 ), e k ( ®) are positive by definition. Then the lowest energy of this system, called the 


Block system, is zero, and the corresponding ground state is characterized by zero-momentum 
“electrons” and “holes.” This state will be called the physical vacuum state. In the theoretical 
developments in this section, we look for the ground state of the generalized BCS system whose 
energy is negative. 

We now examine the physical meaning of the interaction strengths v^ (= V^yjj). Noting that the 

exchange of a phonon can pair-create to pair-annihilate “electron” (“hole”) pairons, let us examine 
parts of the interaction terms in (7.45) : 

(7 47 ^ ■ 

The first term generates a transition of the electron pair from (kf, -kj) to (k’f, - k’j). This transition 
is presented by the k-space diagram in Figure 7.14 . Such a transition may be generated by the 
emission of a virtual phonon with momentum q = k’ — k(-q) by the down(up)-spin “electron” and 
subsequent absorption by the up(down)-spin “electron” as shown in Figure 7.16a.b . These two 
processes are distinct, but yield the same net transition. As we saw earlier the phonon exchange 
generates an attractive change of states between two “electrons” whose energies are nearly the same. 
The Coulomb interaction generates a repulsive correlation. The effect of this interaction is included 
in the strength v n . Similarly, the exchange of a phonon induces a change of states between two 


“holes,” and it is represented by the second term in (7.47) . The exchange of a phonon can also pair- 
create or pair-annihilate “electron” (“hole”) pairons, called -(+) pairons, and the effects of these 
processes are represented by 


Figure 7.16 (a,b) Two Feynman diagrams representing a phonon exchange between two electrons. 
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These two processes are indicated by k-space diagrams in Figure 7.17 . The same processes can be 
represented by Feynman diagrams in Figure 7.18 . where the time flows upwards by convention. 
Accordingly, “electrons” (“holes”) proceed in the positive (negative) time directions. A phonon is 
electrically neutral; hence the total charge before and after the phonon exchange must be the same. 
The interaction Hamiltonians in (7.47) and (7.48) all conserve charge. 

Figure 7.17 k-space diagrams representing (a) pair creation of ground pairons and (b) pair 
annihilation. 



Figure 7.18 Feynmann diagrams representing (a) pair-creation of ±ground pairons from the physical 
vacuum, and (b) pair annihilation. 



For type I elemental superconductors, the interaction strengths are all equal to each other; 

(7.49) v n = V 22 = v u = v 2 \ = i>o (type I). 

In high- T c superconductors, the interaction strengths Vjj are not equal because the Coulomb repulsion 
is not negligible and inequalities 

(7.50) v n = V 22 < i*2i2 = i*2i (high-r c ) 

hold. For further discussion of the generalized BCS Hamiltonian, see original papers [22-26]. 


7.7.2 The Ground State 















We now look for the ground state of the generalized BCS system. At 0 K there are only ± ground 
pairons. The ground state for the system may then be constructed based on the reduced 
Hamiltonian ?i red in 17.451 . Following BCS [12], we assume that the normalized ground state ket pF) 

can be written as 


m=Tt- 


i+fip n '. 




|0) ■ 


05T) 

Here the ket |0) by definition satisfies 
(7.52) 4i’l 0 ) = = 0. 


It represents the physical vacuum state for “electrons” (1) and “holes” (2), that is, the ket |0) = |<|> 

represents the ground state of the Block system with no “electrons” and no “holes” present. In 17.511 
the product variables k (and k’) extend over the region of the momenta whose associate energies are 
bounded: 0 < s k (2) < hco D , and this limitation is indicated by the primes on the product symbols. 

The k-space shell in which pairons are generated and intercorrelated are shown in Figure 7.19 . 


Figure 7.19 The k-space shell in 2D where pairons of both charge types are generated and 
intercorrelated. 



Since [b k ®^] 2 = 0 [see (7.461 ]. only two terms appear for each k (or k’) in the ground state ket 
(7.511 . The quantity |g k ®^| 2 represents the probability that the pair states (kf, —kj) are occupied. By 
expanding the product, we can see that the BCS ground state IT 1 ) contains the zero-pairon state |0), one- 
pairon states b®t|0), two-pairon states b^b®^|0), ... The ket IT*) is normalized such that 

(7.53)(y|y) = l- 

In the case where there is only one state k in the product, we obtain 


f 1F|1F> = f0|- 


l + sW 


i+ei 




( i+ k r) 


1}2 


|0} = 1 


The general case can be worked out similarly (Problem 7.7.3). 

Since the ground state wave function has no nodes, we may choose g k ® to be nonnegative with no 
loss of rigor: g k ® > 0. We now determine {g k ®} such that the ground state energy 
(7.54)V = (P|fli|P> 

























has a minimum value. This maybe formulated by the extremum condition: 
q^ &W =d{W\n Tsd \W} = 0. 

The extremum problem meant by (7.55) with respect to the variation in g’s can more effectively be 
solved by working with variations in the real probability amplitudes u’s and v’s defined by 


The normalized ket IT 1 ) can then be expressed by 

m=rT (.? + n' (»?+w • 

(7.57) * * 

The energy W can be written from (7.54) as (Problem 7.7.4) 

*=+e ' 2 w 2 - e'e' e e w • 

(7.58) * *' t k r i j 

Taking the variations in v’s and u’s, and noting that u k ® 8u k ® + v k (j) 8v k ® = 0, we obtain from (7.55) 
and 7.581 (Problem 7.7.5) 

W - K - O E' MW+ "XX )=o • 

C7.5<fl »' 

To simply treat these equations subject to the equations in (7.561 . we introduce a set of energy 
parameters: 

A U) / UP +A U)2\ yi 

(7.601 * ' * + * ) 

such that (Problem 7.7.6) 


i+O 


- 1/2 


, uW + V W = 1 


(J)2 (J)2 

u k ~ v k ~ 


£ 
s 0) ' 


uU'v™ = 


A V 


2£, 


U) 


(7.611 —* 

Then, Equation (7.59) can be re-expressed as 

/ 2 

4'=EE^dn- 

(7.621 *' 1=1 2E k f 

Since the rhs of (7.62 does not depend on k, the energy gaps 
(7.63) 4 J> = A j 

are independent of k. Hence, we can simplify (7.621 to 

A 


a j =Y!ll Vi j: 


:P) - 


(7.641 *' i 2E k' 

These are called gemeralized energy gap equations. As we shall see later, E k (l) is the energy of an 

unpaired electron (or “electron”). These “electrons” have energy gaps A(j) relative to the Fermi 
energy as shown in Figure 7.20 . Notice that there are in general two types of energy gaps: “electron” 
G = 1) and “hole” (j = 2) energy gaps, (A b A 2 ). 


Figure 7.20 Quasielectrons have an energy gap A relative to the Fermi energy. 
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Using (7.58) along with (7.61) - (7.64) . we calculate the energy W and obtain (Problem 7.7.7) 

*=E'E 2 'i ,i f - E'E'E E "M?"" 1 -"' 








065) 

In the bulk limit the sums over k are converted into energy integrals, yielding 

2 ft £Ud 

= X>j(°) / 
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de 


066) 
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2 £ 2 + Zl 




The ground state IT 1 ), from (7.51) . is a superposition of many-pairon states. Each component state 
can be obtained from the physical vacuum state |0) by pair-creation and/or pair-annihilation of 
±pairons and pair states change through a succession of phonon exchanges. Since the phonon- 
exchange processes, as represented by (7.48) . can pair-create (or pair-annihilate) ±pairons 
simultaneously from the physical vacuum, the supercondensate is composed of equal numbers of 
±pairons. We can see from Figure 7.18 that the maximum numbers of +(-)pairons are given by 
l^hcoDU^O) (l/2ho) D i) 2 (6)). We must then have 

067) Di(0) = P2(0) = x>{0), 

which will be justified by the assumption that the supercondensate is generated only on part of the 
Fermi surface (see Section 7.8.8). 

Using (7.67) . we obtain from (7.66) (Problem 7.7.8) 

-. 1 / 2 ' 


W - -Notw'iF w*) - 


U’< = fiu) B J 1 


■*te) 1 


< 0 


068) 

We thus find that the ground state energy of the generalized BCS system is negative, that is, the energy 
is lower than that of the Bloch system Further note that the binding energy |wj per pairon may in 

general be different for different charge types. 

Let us now find Aj from the gap Equations (7.64) . In the bulk limit, these equations are simplified to 

ftflJD ft^D 


= Ej , 2 >< 0 ) J 


de- 


di 


(e 2 + d[) 


1/2 


+ 


Ej2D(0) f 


d£- 


(e 2 + A\) 


2\V 2 


= ^vjil>(0)disinh 1 + ^ J2 I?(0)dismh 1 ■ 
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For type I elemental superconductors, we assume that the interaction strengths v^ are all equal to 



























each other: v n = v 12 = v 21 = v 22 = v o- We see from 17.691 that “electron” and “hole” energy gaps 
coincide: 

(7.70) A\ = A 2 = A . 

The generalized gap Equations 17.641 are then reduced to a single equation: 

A 




2 d'l ’ 

which is called the BCS energy gap equation. After dropping the common factor A and taking the bulk 
limit, we obtain (Problem 7.7.9) 


1 = voD(O) sinh 


-i 


~) 


(7.72) 

Solving this we obtain (Problem 7.7.10) 

fc<a> D 


A = 


(= N 0 u> 0 ) 


(7.73) sinh(l/(voX>(0))) 

We now substitute 17.731 into 17.681 and calculate the ground state energy. After straightforward 
calculations, we obtain (Problem 7.7.11) 

y- - 2p W* v » (= n 0 . 0) . 

(7.74-1 «pW(w>D(0)))-l 1 01 

Equations 17.731 and 17.741 are the famous BCS formulas for the energy gap and the ground state 
energy, respectively. They correspond respectively to 12.401 and 12.421 of the original paper [19]. We 
stress that these results are exact, obtained from the reduced BCS Hamiltonian ^ red in 17.451 without 

using weak coupling limit (v 0 —► 0). 

Problem 7.7.1. Verify 17.451 . 

Problem 7.7.2. Verify the commutation relations 17.461 for pair operators. 

Problem 7.7.3. Verify 17.531 for the general ket I'Pj. Hint: Assume that there are only two k-states in 
the product. If successful, then treat the general case. 

Problem 7.7.4. Derive 17.581 . 

Problem 7.7.5. Derive 17.591 . 

Problem 7.7.6. Check the consistency of 17.561 and 17.611 . Hint: Use the identity: (u 2 + v 2 ) 2 - (u 2 - 
v 2 ) 2 = 4uV. 

Problem 7.7.7. Verify 17.651 . 

Problem 7.7.8. Verify 17.681 . 

Problem 7.7.9. Derive 17.721 . 

Problem 7.7.10. Verify 17.731 
Problem 7.7.11. Derive 17.741 . 


7.8 Remarks 


We have uncovered several significant features of the ground state of the generalized BCS system 






























7.8.1 The Nature of the Reduced Hamiltonian 

The reduced Hamiltonian ^ red in (7.451 has a different character from the normal starting Hamiltonian 

for a metal, which is composed of interacting electrons and ions. BCS envisioned that there are only 
zero-momentum pairons at 0 K. Only the basic ingredients to build up zero-momentum pairons are 
incorporated in the BCS Hamiltonian, see (7.5) . Bloch “electrons” and “holes” are introduced from 
the outset. These particles are the elementary excitations in the normal state above the critical 
temperature. 


7.8.2 Binding Energy per Pairon 


We may rewrite (7.741 for the ground state energy in the form: 

■*» j. 

W - NqWq , No - fi<o D Vm , Wo - - — -, 

(7.75) e*p(2/(v<,D(0))) - 1 

which can be interpreted as follows: the greatest total number of pairons generated consistent with the 
BCS Hamiltonian is equal to hco D u(0) = N 0 . Each pairon contributes a binding energy |w 0 |. This 

energy |w 0 | can be measured directly by quantum tunneling experiments. Our interpretation of the 

ground state energy is quite natural, but it is distinct from that of the BCS theory, where the energy gap 
A is regarded as a measure of the binding energy. Our calculations do not support this view, see 
Section 7.8.4. 

By the Meissner effect a superconductor expels a weak magnetic field B from its interior. The 
magnetic energy stored is higher in proportion to B 2 and the excluded volume than that for the uniform 
B-flux configuration. The difference in the energy for a macroscopic superconductor is given by 


V B 2 

(7.76) ' 

If this energy exceeds the difference of the energy between super and normal conductors, W s - W N , 
which is equal to |W 0 |, the superconducting state should break down. The minimum magnetic field B c 
that destroys the superconducting state is the critical field at 0 K, B c (0) = B 0 . We therefore obtain 
(Problem 7.8.1) 


(171) |W S - Wkl = | W,| = NoKI = i VBjVF 1 . 

which gives a rigorous relation between the binding energy |w 0 | and the critical field B 0 . 


7.8.3 The Energy Cap 


In the process of obtaining the ground state energy W by the variational calculation, we derived the 
energy-gap equation (7.641 . which contains the energy parameters (see (7.60) 1 



The fact that E k ® represents the energy of a quasielectron, can be seen as follows [14]: the 


quasiparticle energy is defined to be the total excitation energy of the system when an extra particle is 
added to the system From (7.58) we see that by negating the pair state (kf, — kj), the energy is 











increased by (Problem 7.8.2) 


-2 4 V > 2 + 2 




JV 1 ) 

* k * 


079) =-2 B »ff» 1 + 2^ 1 i^F« 


where we used (7.61) and (7.62V To this energy we must add the energy of e k ( 1 * of the added 
“electron.” Thus, the total excitation energy As is (Problem 7.8.3) 


Thus, the unpaired electron (or “electron”) has the energy E k W as shown in Figure 7.20 . Note that the 
validity domain for the above statement is 0 < s k (l ^ < ho) D . 


7.8.4 The Energy Cap Equation 

The reduced Hamiltonian ^ red was expressed in terms of zero-momentum pairon operators, b’s, only 

as in (7.45T The ground state ket (T 1 ) in (7.5 If contains b’s only. Yet in the energy-gap equations, 
which follow from the extremum condition for the ground state energy, the energies of the 
quasielectron, E k ®, appear unexpectedly. Generally speaking the physics is lost in the variational 

calculation. We shall derive the gap equation from a different angle by using the equation-of-motion 
method (see Appendix A.2). 

Let us rederive the gap Equations (7.64) by using the equation-of-motion method. 

The supercondensate is made up of ground pairons, which can be described in terms of the pairon 
operators, b’s. We calculate [^ red , b k ( 1 and [^ red , b k ^ 2) ] to obtain (Problem 7.8.4) 

[*,«., I-™] = £,1.™ 


=- f (i - -a - -a*) ■ 

(7.8r> V *' *' / 



where ?/ red is the reduced Hamiltonian (7.45V Equations (7.8 If and (7.82) indicate that the dynamics 
of ground pairons depends on the presence of “electrons” (quasielectrons) describable in terms of 
n®. We now multiply (7.81f from the right by cjqpQ, where cjq is the electron pairon energy-state 
annihilation operator, and take a grand ensemble trace. From the first term on the rhs, we obtain 

(7 83) 2.»m {»{*#,*} = = 2 £ <XV'f. - 

where 

(7.84) Fi = Wi|V|- 

From the first and second sums, we obtain (Problem 7.8.5) 


















(7.85) *' 

Since we are looking for the ground state energy, the eigenvalues E l and E 2 are zero: 


(7.86) £i = E 2 = 0. 

Collecting all contributions, we obtain 


24MV - K 3 - tf") E' b-K 1 + 

*' 

where we used (Problem 7.8.6) 

(ii (i) 

(7.881 = "*■ 

Since Fj = ( X P|()) 1 |'P) ^ 0, we obtain from (787) 


(787)L 


Fi = 0, 


24W- 

(7.89) 

which is just one of the equations in (7.59k the equations equivalent to the energy gap equations 
(764). 

The ground state of IT) of the BCS system is a superposition of many-pairon states and therefore 
quantities like (F|b k ' J ^|T), (T|b k ®^b k ^|T), ..., that connect states of different particle numbers do not 
vanish. In this sense the state |T) can be defined in a grand ensemble. 

We emphasize that the supercondensate state is made up of equal numbers of ±pairons, and this state 
is reachable from the physical vacuum by a succession of phonon exchanges. Since pair creation and 
pair annihilation of pairons actually lower the system energy [see (7.481 ] and since pairons are 
bosons, all pairons available in the system are condensed into the zero-momentum state; the maximum 
number of pairons is hro D p(0). The number of condensed pairons at any one instant may fluctuate 

around the equilibrium value; such fluctuations are in fact more favorable. 


K 13 -vT)E( 






7.8.5 Neutral Supercondensate 

The supercondensate composed of equal numbers of ±pairons is electrically neutral. This neutrality 
explains the stability of the superconducting state against a weak electric field because no Lorentz 
electric force can be exerted on the supercondensate. The stability is analogous to that of a stationary 
excited atomic state, say, the 2p-state of a neutral hydrogen atom 
A neutral supercondensate is supported by experiments. If a superconducting wire S is used as part 
of a circuit connected to a battery, as shown in Figure 7.21 . then the wire S, having no resistance, 
generates no potential drop. If a low-frequency AC voltage is applied to it, its response becomes 
more complicated. But the behavior can be accounted for if we assume that it has a normal component 
with a finite resistance and a super part. This is the two fluid model [28, 29]. Super part or 
supercondensate, decreases with rising temperature and vanishes at T c . The normal part may be 

composed of any charged elementary excitations including quasielectrons and excited pairons. At any 
rate, analysis of all experiments indicate that the supercondensate is not accelerated by the electric 
force. This must be so. Otherwise the supercondensate would gain energy without limit since the 
supercurrent is slowed down by neither inpurities nor phonons, and a stationary state would never 













have been observed in the circuit. 

Figure 7.21 A circuit containing a superconductor (S), battery, and resistance. 



7.8.6 Cooper Pairs (Pairons) 

The concept of pairons is inherent in the BCS theory, which is most clearly seen in the reduced 
Hamiltonian ^ red , expressed in terms of pairon operators, b’s, only. The direct evidence for the fact 

that a Cooper pair is a bound quasiparticle having charge (magnitude) 2e comes from flux 
quantization experiments, see Figures 7.5 and 7.6 . 

7.8.7 Formation of a Supercondensate and Occurrence of 
Superconductors 

We discuss the formation of a supercondensate based on the band structures of electrons and phonons. 
Let us first take Pb, which forms a face-centered cubic (fee) lattice and which is a superconductor. 
This metal is known to have a neck-like hyperboloidal Fermi surface represented by 

(n om E = + =4- + p- - <»1> = (11®, 0-244. -8L71}m , 

(7.90) 2mi 2m 2 2mj 

where mis the electron mass. See, for example, [30], Section 13.4. 

We postulate that the supercondensate composed of ±ground pairons is generated near the “necks.” 
The electron transitions are subject to Pauli’s exclusion principle, and hence creating pairons 
requires a high degree of symmetry in the Fermi surface. A typical way of generating pairons of both 
charge types by one phonon exchange near the neck is shown in Figure 7.18 . Only parts of “electrons” 
and “holes” near the specific part of the Fermi surface are involved in the formation of the 
supercondensate. The numbers of ±pairons, which are mutually equal by construction, may both then 
be represented by hco D ^(0)/2, which justifies (7.67) . Next take aluminum (Al), which is also a known 

fee superconductor. Its Fermi surface contains inverted double caps. Acoustic phonons with small 
momenta may generate a supercondensate near the inverted double caps. Supercondensation occurs 
independently of the lattice structure as long as “electrons” and “holes” are present in the system 
Beryllium (Be) forms a hexagonal closed packed (hep) crystal. Its Fermi surface in the second zone 
has necks. Thus, Be is a superconductor. Tungsten (W) is a body-centered cubic (bcc) metal, and its 
Fermi surface has necks. This metal also is a superconductor. In summary, type I elemental 
superconductors should have hyperboloidal Fermi surfaces favorable for the creation of ±pairons 
mediated by small-momentum phonons. All of the elemental superconductors whose Fermi surface is 
known appear to satisfy this condition. 

To test further let us consider a few more examples. A monovalent metal, such as Na, has a 
spherical Fermi surface within the first Brillouin zone. Such a metal cannot become superconducting 
at any temperature since it does not have “holes” to begin with; it cannot have +pairons and, 












therefore, cannot form a neutral supercondensate. A monovalent fee metal like Cu has a set of necks at 
the Brillouin boundary. This neck is forced by the inversion symmetry of the lattice, see Figure 3.7 . 
Chapter 3. The region of the hyperboloidal Fermi surface may be more severely restricted than those 
necks (unforced) in Pb. Thus, this metal may become superconducting at extremely low temperatures, 
which is not ruled out. 


7.8.8 Blurred Fermi Surface 


A normal metal has a sharp Fermi surface at 0 K. This fact manifests itself in the T-linear heat 
capacity generally observed at the lowest temperatures. The T-linear law is in fact the most important 
support for the Fermi liquid model. For a superconductor the Fermi surface is not sharp everywhere. 
To see this, let us solve 17.611 with respect to u k 2 and v k 2 . We obtain 


(7.91) 
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•*1 = r 1 4- 
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Figure 7.22 shows a general behavior of v k 2 near the Fermi energy. For the normal state A = 0, there 
is a sharp boundary at s k = 0; but for a finite A, the quantity v k 2 drops off to zero over a region of the 
order (2 ~ 3)A. This v k 2 represents the probability that the virtual electron pair at (kf, -kj) 

participates in the formation of the supercondensate. It is not the probability that either electron of the 
pair occupies the state k. Still, the diagram indicates the nature of the changed electron distribution in 
the ground state. The supercondensate is generated only near the necks and/or inverted double caps. 
Hence, these parts of the Fermi surface are blurred or fuzzy. 


Figure 7.22 The behavior v k 2 near the Fermi surface. 


Problem 7.8.1. 
Problem 7.8.2. 
Problem 7.8.3. 
Problem 7.8.4. 
Problem 7.8.5. 
Problem 7.8.6. 



Verify (7.79k 

Verify 17.80k 

Derive 17.811 and 17.82k 

Verify 17.85k 

Verify CL88). 



















7.9 Bose-Einstein Condensation in 2D 


BCS [19] introduced electron-pair operators: 

( 7 . 92 ) “ c t+ c — k — “ c k c —k ' kfc = t—k c k r 

where (c, fy) are electron operators (spin indices omitted) satisfying the Fermi anticommutation rules. 
They investigated the commutators among b and fy, which do not satisfy the usual Bose commutation 
rules. On the basis of these commutators and b k 2 = 0, BCS did not consider the bosonic nature of the 
pairons. But the eigenvalues for n 12 = C]fy c k2 c k = c," 1 ' c 2 ^ c 2 ^ in the pair states (k l5 k 2 ) are 
limited to 0 or 1 (fermion property), while the eigenvalues of the total pair number operator 

"o = Y] 

(7.93) t 

have no upper limit (bosonic property): 

17.941 wo-0,1,2,... 

The proof of 17.941 is given in Appendix A.4. Both fermionic and bosonic natures of the pairons 
must be used in the total description of superconductivity. 

The most important signature of many bosons is the Bose-Einstein Condensation. Earlier we 
showed that the pairon moves with the linear dispersion relation:^ 

17.951 wp = wo + cp , 

where we designated the pairon net momentum by the more familiar p rather than q. 

Let us consider a 2D system of free bosons having a linear dispersion relation: s = cp, c = 2/7iv F . 
The total number of bosons, N, and the Bose distribution function, 

(7.96) " e«2]-l " *< s| °»• 

are related by 

* = E hM't) = N * + > 

17.971 F 
where 

(7.98) 

is the number of zero-momentum bosons. The prime on the s umm ation in 17.971 indicates the omission 
of the zero-momentum state. We note that a is defined by a = pp, where p is the chemical potential 
and p = l/(k B T). For notational convenience we write 
2 

(7 99) E = w (> °) ■ 

We divide 17.971 by the normalization area L 2 , and take the bulk limit: 

(7.100) N ^ oo, L—► oo while NL~ 2 = n . 

We then obtain 
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e-n>Q 
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17.1011 

where ly, = N 0 /L 2 is the number density of zero-momentum bosons and n the total boson density. After 











performing the angular integration and changing integration variables, we obtain from (7.101! 
(Problem 7.9.1) 


(7.102> 


2x1i 2 c 2 f} 2 (n- n 0 ) = j y - (x =Pe, A = e* 1 ) 


The fiigacity X is less than unity for all temperatures. After expanding the integrand in (7.102! in 
powers of A,e -X (< 1), and carrying out the x-integration, we obtain 

(7.103! -w > 

*»<*>-££• ^^^ - 

(7.104) 

We need cj) 2 (2e) here, but we introduced <|) m for later reference. Equation (7.103! gives a relation 
among X, n, and T. 

The function <J) 2 (X,) monotonically increases from zero to the maximum value (t) 2 (l) = 1.645 as X is 
raised from zero to one. In the low-temperature limit, X = 1, 4> 2 (^) = c|> 2 ( 1) = 1.645, and the density of 
excited bosons, n x , varies like T 2 as seen in (7.103! . This temperature behavior of n x persists as long 
as the rhs of (7.103! is smaller than n; the critical temperature T c occurs at n = k B 2 T c 2 (j) 2 (l)/27rh 2 e 2 . 
Solving this, we obtain 

(7.105) *bT c = 1.954W /2 (= 1.24ftVFtt vz ) . 

If the temperature is raised beyond T c , the density of zero-momentum bosons, no, becomes 
vanishingly small, and the fiigacity X can be determined from 

„ * b tV 2 <A> T 
(7.106! ’ c - 

In summary, the fiigacity X is equal to unity in the condensed region: T < T c , and it becomes smaller 
than unity for T > T c , where its value is determined from (7.1061 . 

The internal energy density u, that is, the thermal average of the system energy per unit area, is 
given by 

n = —L_ /d 2 pe/ B {£). 

(7.107! J 

This u can be calculated in a similar manner. We obtain (Problem 7.9.2) 

u frw 2ni UMi 

(7.108! «ft 2 ^> b t c 2 4M1>‘ 

The molar heat capacity at constant density (volume), C n , is given by 


(7.109) 


C = Q = 


n) 
dT 


where R is the gas constant. The partial derivative du/dt maybe calculated through (Problem 7.9.3) 


3»{T,rc) 3w(T,A) ( 3u(r p A)3A(7» 

9T “ 3T + 9A 3T 

3i*<T, A) 0w{T, A) A )fBT 


(7.11Q! “ dT 3A 3n(T p A)/3A ' 

All quantities (n, u, C) can now be expressed in terms of (j) m (7). After straightforward calculations, 
the molar heat capacity C is given by (Problem 7.9.4) 
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In the condensed region T < T c , the fugacity X is unity. We observe that as X 


oo 

01£> _1 = w , 02 ^)-^ 02 ( 1 ) = 1 . 645 , 

1 


(7.112) 0j(A) -> 0j(l) = 1.202 , 0 + a)^0 + {l} = 1082 . 
Using these, we obtain from (7.1081 and (7.1111 

0j(b u 


(7.H3) u=2nka m?r 


C = 6bM1I 
(7.114) Wi) 


(ff’ 


(T < T c ). 


Observe that the molar capacity C grows like T 2 . Also note that the molar heat capacity C at T c is 


given by 
(7.115 

For T > T c , the temperature dependence of X, given by Equation (7.1061 . is quite complicated. We 

can numerically solve (7.1111 for X with a computer, and substitute the solution in (7.1021 to obtain 
the temperature behavior of C. The result is shown in Figure 7.23 . Equations (7.1101 and (7.1111 
allow us not only to examine the analytical behavior of C near T = T c but also to obtain C without 

numerically computing the derivative <9u(T, n)/ST. 


Figure 7.23 The molar heat capacity C for 2D massless bosons rises like T * 2, reaches 4.38 R at T c , 

and then decreases to 2 R in the high-temperature limit. 

CfR 



In summary, the molar heat capacity C for a 2D massless boson rises like T 2 in the condensed 
region, reaches 4.38 R at T = T c , and then decreases to the high-temperature limit value 2 R. The heat 

capacity changes continuously at T = T c , but its temperature derivative <9C(T, n)/cT jumps at this 
point. The order of phase transition is defined to be that order of the derivative of the free energy F 
whose discontinuity appears for the first time. Since C v = T(SS/dT) v = - T(<3 2 F/<9T 2 ), <9C V /<9T = 























-T(d 3 F/ST 3 ) - (d 2 F/ST 2 ), the BEC is a third-order phase transition. Note that the temperature 
behavior of the heat capacity C resembles that observed in YBCO shown in Figure 7.12 . The 
condensation of massless bosons in 2D is noteworthy. This is not a violation of Hohenberg’s theorem 
[31] that there can be no long range order in 2D, which is derived with the assumption of an f-sum 
rule representing the mass conservation law. In fact no BEC occurs in 2D for finite-mass bosons. 
Problem 7.9.1. Verify (7.102) . 

Problem 7.9.2. Verify (7.108) . 

Problem 7.9.3. Prove (7.110T 
Problem 7.9.4. Verify 17.1111 . 

7.10 Discussion 

The idea that the superconductivity is a manifestation of the BEC has long been suspected. The 
superconductivity in a metal and the superconductivity in liquid helium have many similarities. Both 
involve dissipationless flows, and they occur at very low temperatures. In particular, F. London 
treated superconductivity and superfluidity from the BEC point of view in his two-volume book [32]. 
The BEC temperature T c in D dimensions can be found from 
_Wo_i f, D i 

(7.1161 H<> {2nh} D ] P exp{f/{k B TS-i ’ * CP ' 

Using (7.431 and (7.441 along with (7.1071 . we obtain (Problem 7.10.1) 

T = j 1.01ft (3D) 

(7.1171 ' (l.24AvtVC (2D). 

The 2D BEC is noteworthy since the BEC of massive bosons (s = p 2 /(2m)) is known to occur in 3D 
only. The interpairon distance r 0 computed from (7.1171 is 

Uo VJ = l-Glfi^Mr 1 (3D) 

(7.1181 ^ I "o VZ = T c r 11 (2D) . 

The zero-temperature BCS pairon size [19] is given by 

(7.119) fr B T c 

From the last two equations we obtain 
n. = {5.6 (ID) 

(7.120) ! 6 - 9 < 2D >- 

indicating that the condensed pairons do not overlap in space. Hence, the free pairon model can be 
used to evaluate T c . 

The similarity in 2D and 3D BEC is most remarkable. In particular the critical temperature T c 
depends on (v F , r 0 ) nearly in the same manner. Now, the interpairon distance r 0 is different by the 
factor 10 2 ~ 10 3 between 3D and 2D superconductors. The Fermi velocity v F is different by the factor 

10 ~ 10 2 . Hence, the high critical temperature in 2D superconductors is explained by the very short 
interpairon distance, partially compensated by a smaller Fermi velocity. 

















We stress that formulas (7.117t for the critical temperatures are distinct from the famous BCS 
formula (in the weak coupling limit): 

tsTc = 1.136tyo exp { —^— 1 , 

( 7 . 121 ) e \v«v{Q)) 

where co D is the Debye frequency, v 0 the pairing strength, and p(0) the DOS per spin at the Fermi 
energy. 

The pairon density ty, and the Fermi velocity v F appearing in formulas (7.1171 can be determined 

experimentally from the data of the resistivity, the Hall coefficient, the Hall angle, the specific heat, 
and the superconducting temperature. 

The linear dispersion relation can be probed by using Angle-Resolved Photoemission 
Spectroscopy (ARPES). Lanzara el al. [33] studied the dispersions in three different families of hole- 
doped copper oxides: Bi 2 Sr 2 CaCu 2 0 8 (Bi2212), Pb-doped Bi 2 Sr 2 Cu 6 (Pb-Bi2201), and La 2 _ 

x Sr x Cu0 4 (LSCO). A summary of the data, reproduced from [33, Figure 1], is shown in Figure 7.24 . 

The energy is measured downwards and the reduced momentum k is in the abscissa. See more 
detailed specification in the original reference. The data in Figure 7.24a.b are in the superconducting 
states while those in Figure 7.24c are in the normal state. Note that in all three cases the dispersion 
relation is linear for low k and quadratic for high k. The phonon energy has an upper limit of the order 
hro D (Debye energy) and hence, the quasiparticle (pairon) mediated by the phonon exchange must 

have a finite energy. The change of the slopes, indicated by thick arrows, occurs around 50-80 meY 
which are distinct from the superconducting energy gaps (10~50meV). The energies 50-80 meV 
appear to correspond to the energy of the in-plane oxygen-stretching (breathing) longitudinal optical 
phonon. 

Figure 7.24 The quasiparticle dispersion relations derived from the momentum distribution curves 
along (0, 0 )-(jt, 7i) for (a) LSCO at 20 K, (b) Bi2212 at 20 K. Both materials are in superconducting 
states with the doping 8 indicated, (c) The dispersion relation for Bi2201 at 30 K (normal state). The 
temperature dependence of the dispersion relation for LSCO at 8 = 0.15 (d) and Bi2212 at 8 = 0.16 
(e). Note that the energy scale is measured downward. The arrows indicate slope changes in the 
curves. 

After Lanzara et. aL [33] 










(d) h- (e) h- 


Figure 7.24d.e indicate that the dispersion relations do not change above and below T c for LSCO 

and Bi2212, respectively. The pairons have a linear dispersion relation with the same slope both 
below and above T c . Thus, the ARPES fully supports our BEC picture of superconductivity. We 

stress that the pairons do not break up at T c as thought in the original BCS theory. 

Pairons can multiply occupy the same CM momentum state. They move freely as bosons via a linear 
dispersion relation: s = cp. The system of free pairons (bosons) undergoes a BEC transition of the 
second (third) order in 3(2)D with the critical temperature 

T c = l.Ol&i*( 1 - 24 * 1 * ( 

where ly, is the pairon density. In general the critical temperature T c for free bosons moving in D 

dimensions can be found from(7JT6). The solutions for D = 2 and 3 are given in (7.1181 . For D = 1, 
(7.1161 has no solution (Problem 7.10.2). In other words, there is no BEC in ID. To see this, let us 
take a dispersion relation: 

( 7 . 122 ) £= ap a , 

where a and a are constants. If we substitute this s in (7.116) . we can find a solution if a < 1. The 
index a, however, must be greater than, or equal to, unity; otherwise the boson has an infinite speed at 
zero momentum 

Problem 7.10.1. Derive (7.1171 . 

Problem 7.10.2. Prove that (7.1161 has no solution (that is, no T c ) for D = 1. 


References 

1 Kammerlingh Onnes, H. (1991) Acad. Wetenschappen, Amsterdam, 14, 113. 

2 Meissner, W. and Ochsenfeld, R. (1933) Naturwiss, 21, 787. 

3 File, J. and Mills, RG. (1933) Phys. Rev. Lett., 10, 93. 





















4 Deaver, B.S. and Fairbank, W.M. (1961) Phys. Rev. Lett., 7, 43. 

5 Doll, R. and Nobauer, M. (1961) Phys. Rev. Lett., 7, 51. 

6 Josephson, B.D. (1962) Phys. Lett., 1, 231. 

7 Anderson, P.W. and Lowell, J.M. (1963) Phys. Rev. Lett., 10, 486. 

8 Jaklevie, R.C., Lambe, J., Marcereaw, J.E., and Silver, A.H. (1965) Phy. Rev. A, 140, 1628. 

9 Glover, R.E. HI and Tinkham, M. (1967) Phys. Rev., 108, 243. 

10 Biondi, M.A., and Garfunkelm, M. (1959) Phys. Rev., 116, 853. 

11 Giaever, I. (1960) Phys. Rev. Lett., 5, 147, 464. 

12 Phillips, N.E. (1959) Phys. Rev., 114, 676. 

13 Giaever, I. and Megerle, K. (1961) Phys. Rev., 122, 1101. 

14 Loram, I.W., Mirza, K.A., Cooper, J.R., Liang, W.Y., and Wade, J.M. (1994) J. Superconduct., 7, 
347. 

15 Ginzburg, YL. and Landau, L.D. (1950) J. Exp. Theor. Phys. (JETP) (USSR), 20, 1064. 

16 Bednorz, VG. and Muller, K.A. (1986) Z. Phys. B, 64, 189. 

17 Wu, M.K., Ashburn, J.R., Torng, C.J., Hor, P.H., Meng, R.L, Gao, L., Huang, Z.J., Wang, Y.Q., and 
Chu, C.W. (1987) Phys. Rev. Lett, 58, 908. 

18 Cooper, L.N. (1956) Phys. Rev., 104, 1189. 

19 Bardeen, J., Cooper, L.N. and Schrieffer, J.R. (1957) Phys. Rev. 108, 1175. 

20 Landau, L.D. (1956) J. Exp. Theor. Phys. (JETP), USSR, 30, 1058. 

21 Landau, L.D. (1957) J. Exp. Theor. Phys. (JETP), USSR, 32, 59. 

22 Fujita, S. (1991) J. Superconduct., 4, 297. 

23 Fujita, S. (1992) J. Superconduct., 5, 83. 

24 Fujita, S. and Watanabe, S. (1992) J. Superconduct., 5, 219. 

25 Fujita, S. and Watanabe, S. (1993) J. Superconduct., 6, 75. 

26 Fujita, S. and Godoy, S. (1993) J. Superconduct., 6, 373. 

27 Schrieffer, J.R. (1964) Theory of Superconductuctivity, Benjamin, New York. 

28 Gortor, C.J. and Casimir, H.G.B. (1934) Z. Phys., 35, 963. 

29 Gortor, C.J. and Casimir, H.G.B. (1934) Z. Tech. Phys., 15, 539. 

30 Fujita, S. and Ito, K. (2007) Quantum Theory of Conducting Matter, Springer, New York. 

31 Hohenberg, P.C. (1967) Phys. Rev., 158, 383. 

32 London, F. (1964) Superfluids, vol. I and II, Dover, New York. 

33 Lanzara, A., Bogdanob, P.V, Zhou, X.J., Keller, S.A., Feng, D.L., Lu, E.D., Yoshida, T., Eisaki, H., 
Fujimori, A., Kishio, K., Shimoyama, J.-I., Noda, T., Uchida, S., Hussain, Z., and Shen, Z.-X. (2001) 
Nature, 412, 510. 


1} In the text, quasielectrons (quasiholes) are distinctly denoted by “electrons” (“holes”) 
possessing charge (magnitude) e that circulate counterclockwise (clockwise) viewed from the tip 


of the applied magnetic field vector B. 


2) The second-quantized operators for a pair of “electrons” (1) and a pair of “holes” (2), called 
respectively the -pairon and the +pairon operators, are denoted by B’s as defined in 17.14) , where 

denotes the creation (annihilation) operator for “electron” (“hole”), j = 1(2), 
satisfying the Fermi anticommutation rules: 

= = 0 , 

where s represents up (down)-spin \ Q). 

3) Vy denotes the interaction strengths arising from the phonon-exchange attraction (that is, the 

virtual exchange of a phonon and the Coulomb repulsion between “electrons”). The pairing 
interaction strength Vjj (= y^/V) in the generalized BCS Hamiltonian 17.23) is reduced to v 0 (= 

V 0 /V) in the BCS Hamiltonian 17.5) when the spherical Fermi surface is assumed. Note that y,j 

should appear when the theory incorporates the band structures of electrons explicitly. The theory 
can be applied to (3, 2)D systems in a similar way. 

4) The density of states in the momentum k-space for D dimensions, ® <D) (k), is generally defined 
through 

where L and g (= 2s + 1) denote the normalization length and the spin degeneracy factor, 
respectively. This formula is valid in the bulk limit for any shape of a box L D . For electrons, s = 
1/2 and g is given by g = 2. 

5) The linear dispersion relation of the form w q = w 0 + cq, is given with the assumption of a Fermi 

sphere, a Fermi circle, and a Fermi bar for 3, 2, and ID, respectively. Here the speed c is given in 
terms of the Fermi speed v F by c/v F = 1/2, 2/n and 1 for 3, 2 and ID, respectively. 

6) The reduced Hamiltonian ^ red is reduced from the generalized BCS Hamiltonian (7.23) . where 
the pairon operators, B’s, are replaced in the following way: 

✓a ,(7) _ ( z) m = . (Zj 
^ — —kl ~ * 

since at 0 K, there are no phonons excited, meaning that the phonon momenta q = 0. Then, the pair 
annihilation (creation) operators for “electrons” (1) and “holes” (2) are respectively expressed in 
terms of c’s (c^’s). 

7) The restrictions for the summation symbols in 17.45) are given by 0 < = s k < hco D for 

“electrons” and 0 < s k ^ = |sk| < hoo D for “holes”, respectively. 


8) See 17.43) and 17.44) . The coefficient c in 17.95) is specifically given by (2/7i)v F for a 2D 
system and (l/2)v F for a 3D system, respectively. 











Chapter 8 


Metallic (or Superconducting) SWNTs 


A metallic (semiconducting) single-wall nanotube (SWNT) contains an irrational (integral) number of 
carbon hexagons in the pitch. The room-temperature conductivity is higher by two to three orders of 
magnitude in metallic nanotubes than in semiconducting nanotubes. Tans el al. [5] measured the 
electric currents in metallic single-wall carbon nanotubes under bias and gate voltages, and observed 
various non-Ohmic behaviors. The original authors interpreted their data in terms of a ballistic 
electron transport due to the Coulomb blockage on the electron carrier model. The mystery as to why 
a ballistic electron is not scattered by inpurities and phonons is unexplained, however. An alternate 
interpretation is presented based on the Cooper pair (pairon) carrier model. Superconducting states 
are generated by the Bose-Einstein condensation (BEC) of the ±pairons at momenta 27ihn/L, where L 
is the tube length and n a small integer. As the gate voltage changes the charge state of the tube, the 
superconducting states jump between different n. The normal current peak shapes appearing in the 
transition are temperature-dependent, which is shown to be caused by the electron-optical phonon 
interaction. 


8.1 Introduction 

Single-wall nanotubes (SWNTs) can be produced by rolling graphene sheets into circular cylinders of 
about 1 nm in diameter and microns (pm) in length [1, 2]. Electrical conduction in SWNTs depends 
on the pitch [3, 4] and can be classified into two groups: semiconducting or metallic [5, 6]. In our 
previous work [7] we have shown that this division into two groups arises as follows. A SWNT is 
likely to have an integral number of carbon hexagons around the circumference. If each pitch contains 
an integral number of hexagons, then the system is periodic along the tube axis, and “holes” (and not 
“electrons”) can move along the tube. The system is semiconducting and its conduction is 
characterized by an activation energy s 3 . The energy e 3 has a distribution since both pitch and 

circumference have distributions. The pitch angle is not controlled in the fabrication process. There 
are numerous other cases where the pitch contains an irrational number of hexagons. In these cases the 
system shows a metallic behavior experimentally [5]. We primarily deal with these so-called metallic 
SWNTs in this chapter. 

Tans et al. [5] measured the electric currents in metallic SWNTs under bias and gate voltages. Their 
data from [5, Figure 2], are reproduced in Figure 8.1 . The currents versus the bias voltage are plotted 
at three gate voltages: A (88.2 mV), B (104.1 mV), and C (120.0 mV) (shown in Figure 8.1 ). Some 
significant features are: 

a) A non-Ohmic behavior is observed for all, that is, the currents are not proportional to the bias 

voltage except for high voltages. The gate voltage charges the tube. The Coulomb (charging) 




energy of the charge Q of the system is represented by 
E 

(8.1) ^ “ 2C f 

where C is the total capacitance of the tube. 

b) The current near the origin appears to be constant for different gate voltages V gate , traces A-C. 

This feature was confirmed by later experiments [8, 9]. The current does not change for small 
varying gate voltage in a metallic SWNT while the current (magnitude) decreases in a 
semiconducting SWNT. 

c) The current at gate voltage V gate = 88.2 mV (trace A) reverts to the normal resistive behavior 

after passing the critical bias voltages on both (positive and negative) sides. Similar behaviors 
are observed for V gate = 104.1 mV (trace B) and V gate = 120.0 mV (trace C). 

d) The flat current is destroyed for higher bias voltages (magnitude). The critical bias voltage 
becomes smaller for higher gate voltages. 

e) There is a restricted V gate -range (view window in [5, Figure 3]) in which the horizontal stretch 
can be observed. 

Figure 8.1 Current-voltage curves for nanotubes at gate voltages of 88.2 mV (trace A), 104.1 mV 
(trace B) and 120.0 mV (trace C) at T = 5 mK. 

(After Tans et at in [5, Figure 2].) 



Bias voltage (mV) 

The original authors interpreted the flat currents near V bias = 0 in Figure 8.1 in terms of ballistic 
electron transport based on the electron carrier model. 

We propose a different interpretation of the data in Figure 8.1 based on the pairon (Cooper pair) 
carrier model. Pairons move as bosons, and hence they are generated with no activation energy factor 
(the main characteristic of metallic conduction). All features a)-e) listed above can be explained 
simply by the assumption that the tube is in the superconducting state as explained below. 

The supercurrent runs without obeying Ohm’s law. This explains feature a) above. The 
supercurrents can run without resistance due to phonon and inpurity scattering and with no bias 
voltage. Bachtold et al. [10] observed by scanned probe microscopy that the currents run with no 
voltage change along the tube in metallic SWNTs. They also concluded that the currents run 
ballistically. 







We shall show later that the system may be in a superconducting ground state, whose energy —E g is 

negative relative to the ground state energy of the Fermi liquid (electron) state. If the total energy E of 
the system is less than the condensation energy E g : 

f8,2f ^ = K + Ecod + Ep < E g , 

where K is the kinetic energy of the conduction electrons and the pairons, and 
(8.3) 

is the Coulomb field energy, then the system is stable. The experiments shown in Figure 8.1 were 
done at 5 mK. We drop the kinetic energy K hereafter. The superconducting state is maintained and the 
currents run unchanged if the bias voltage V bias is not too large so that the inequality (8.2) holds. This 

explains the horizontal stretch feature b). 

If the bias voltage is high enough so that the inequality symbol in (8.2) is reversed, then normal 
currents revert and exhibit Ohmic behavior, which explains feature c). 

Feature d) can be explained as follows. For higher V gate there is more charge and hence the charges 
Q a , Qb, Qc f° r three cases (A, B, C in Figure 8.1 1 satisfy the inequalities: 


f8.4I Qa < Qb < Qc ■ 

The horizontal stretches are longer for smaller bias voltages. At the end of the stretch (V biasjnax ) the 
system energy equals the condensation energy E g . We then obtain from 18.21 . after dropping the kinetic 
energy K, 


E$ j jnax — Q^iaE^nax = Q.^in 


(8.5) -Eg-Ecw-Eg-—. 


Using the inequalities f8.4) . we then obtain 

(8.6) ^A,max ** Tnax ^ ^C/naK r 

which explains feature d) in the list above. 

The horizontal stretch becomes shorter as the gate voltage V gate is raised; it vanishes when V gate is 
a little over 120.0 mV The limit is given by 


(8.7) 


Ep.iriax — Eg Eco U ] — F s ^ ~ — 0 . 


g_ 

2 C 


If the charging energy E Coul exceeds the condensation energy E g , then there are no more supercurrents, 
which explains feature e). 

Clearly, the important physical property in our pairon model is the condensation energy E g . 

In the currently prevailing theory it is argued that the electron (fermion) becomes ballistic at a 
certain quantum condition. But all fermions are known to be subject to scattering. It is difficult to 
justify why the ballistic electron is not scattered by inpurities and phonons, which naturally exist in 
nanotubes. In contrast our theory is straightforward. We argue that the super currents, as is known, can 
run with no resistance (due to inpurities and phonons). 

To test the validity of the present theory we simply need to find the superconducting temperature by 
performing the conductivity measurements at higher temperatures. 

Tans et al. [8, 11] noted that the measured currents are bistable at 0.1 and 0 nA. In our interpretation 










these currents are simply the supercurrents that arise from the Bose-condensed states at a finite 
momentum and zero momentum 

In Figure 8.2 . current versus gate voltage at V bias = 30 pV after Tans et al. [5] is shown. The delta 

function-like peaks are quasiperiodic. The flat currents near the origin are caused by the 
supercurrents due to the condensed pairons at a microscopic momentum 


Figure 8.2 Current vs. gate voltage at V bias = 30 pV 


(After Tans et al. in (5, Figure 2].) 



Gate voltage (mV) 


2 xft 

( 8 . 8 ) p = ~r' 

where L is the tube length. The peaks arise from the motion of noncondensed pairons. As the gate 
voltage is varied, the charge density changes, which makes the superconducting states jump among the 
different momenta 


2nfin 

t8.9I Pn ~T~ ’ 

where n is a small integer. In the transitions, the normal currents due to noncondensed pairons appear. 
Tans et al. [5, 8] observed that the conductance G versus the gate voltage V gate has peaks that are 

temperature-dependent as shown in Figure 8.3 . reproduced from [5, Figure 3]. These authors obtained 
good fits with the assumption of the peak formula. 


Figure 8.3 Conductance G = I/V bias versus AV gate at low bias voltage V bias = 10 pV and different 
temperatures. Solid lines are fits of G °c cosh -2 (eAV gate /(a2k B T)), where AV gate is the gate voltage 
difference. The factor a converts AV gate into the corresponding electrostatic potential shift of the tube. 

(After Tans et aL in [5, Figure 3]). 
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where factor a converts the gate voltage shift AV gate into the corresponding electrostatic potential 
shift of the tube. See [5] for the definitions of AV gate and a. We may re-interpret the data as follows. 

The carriers in the normal currents (peaks) observed are noncondensed pairons, each having 
charges, +2e or —2e, which are subject to phonon scattering. 

The phonon scattering rate T is calculated by the standard formula: 


(8.11) T — Mphonon ^paironA , 

where v pairon is the pairon speed and A the pairon-phonon scattering cross section, both of which are 
temperature-independent. We assume that optical phonons at the energy s 0 with no dispersion are the 
predominant scatterers. Then, the phonon density n pbonon is given by the Planck distribution function: 



At the experimental temperatures (110, 390, and 830 mK) the Planck distribution function can be 
approximated by the Boltzmann distribution function; 



In the experimental low-temperature range (110-830 mK) the peak-shape function f8.10t can be 
approximated as 


(8.14) 


cosh ^ =» 4exp(--^%) . 

V a2k K T) b V ) 


This has the same exponential form as formula (M3), suggesting that the cause of the temperature- 
dependence is phonon scattering. This interpretation was also adopted by Saito, Dresselhaus, and 
Dresselhaus [6]. 

In the inset of Figure 8.3 . the current versus the gate voltage is plotted at bias voltage V bias = 0.4, 
0.8, 1.2, and 1.6 mV At V bias =1.6 my we see three steps (plateaus) separated by about 0.04 nA. We 
interpret these as three superconducting channels carried by the pairons condensed at three momenta. 





















The lowest step is formed between V gate =115 and 140 mV and has the current value 0.04 nA. This 
step is also formed similarly at V bias = 0.4, 0.8, and 1.2 mV The second lowest step is formed 
between V gate =115 and 132 my and the third (highest) step is formed between V gate =118 and 128 

mV The supercurrents are additive. We predict that if the bias voltage is raised, then there should be 
more steps. 

The center of mass (CM) of pairons move as bosons and their currents can be generated without 
activation. There are ±pairons, each having charge ±2e. The energy of the moving pairon is given by 

2 (jj 

(8.i5) 

where v F ®, (j = 1 (electron), 2 (hole)), is the Fermi speed, which will be derived later in Section 8.4. 
Since the momentum (magnitude) p n , see t8.9k is extremely small, the pairons in superconducting 
channels have negligible energies. 

If the SWNT is unrolled, then we have a graphene sheet, which can be superconducting at a finite 
temperature. We first study the conductivity of graphene in Section 8.2, starting with the honeycomb 
lattice structure and introducing “electrons” and “holes” based on the Cartesian unit cell as distinct 
from the Wigner-Seitz (WS) unit cell (rhombus). Phonons are generated based on the same Cartesian 
unit cell. In Section 8.3 we construct a Hamiltonian suitable for the formation of the Cooper pairs. A 
linear energy-momentum (or dispersion) relation is derived in Section 8.4. The pairons moving with 
a linear dispersion relation in two dimensions undergo a Bose-Einstein condensation (BEC), which is 
shown in Section 8.5. Superconductivity in metallic SWNTs is discussed in Section 8.6. A summary 
is given in the last section. 

8.2 Graphene 

Before dealing with a metallic SWNT, let us consider graphene, in which carbon ions (C + ) occupy a 
two-dimensional (2D) honeycomb crystal lattice. The normal carriers in the transport of electrical 
charge are “electrons” and “holes.” The “electron” (“hole”) is a quasielectron that has an energy 
higher (lower) than the Fermi energy s F and those electrons (holes) are excited on the positive 

(negative) side of the Fermi surface with the convention that the positive normal vector at the surface 
points in the energy-increasing direction. The applied gate voltage can control the carrier charge type, 
j = 1, 2, electron (1) or hole (2), and the number density nj. The “electron” (“hole”) wavepacket is 

assumed to have a charge -e (+e) and has the size of a unit hexagon formed by positively charged 
carbon ions C + . The positively charged “hole” tends to stay away from the C + ions. The center of 
mass of the “hole” (wave packet) is at the center of the hexagon and hence its charge is concentrated 
at the center of the hexagon. It moves easily with a small effective mass along the directions {100c- 
axis) = {100), where we used the conventional Miller indices for the hexagonal lattice with the 
omission of the c-axis index. The electron has a negative charge (-e) and a unit hexagon size. The 
negatively charged electron tends to stay close to the C + ions and the charge distribution is more 
concentrated near the C + hexagon. The CM of the “electrons” (wavepacket) is also at the center of the 
hexagon. It moves easily with a small effective mass along the directions (110), see Figure 8.4 and 
below. 




Figure 8.4 A rectangular (dotted line) unit cell of graphene. An open circle (O) indicates the positive 
ions, C + . 


[ 110 ] 



For a description of the electron motion in terms of the mass tensor, it is convenient to introduce 
Cartesian coordinates, which do not necessarily match the crystal’s natural (triangular) axes. The 
“electron” (wavepacket) may move up or down in (110) to the neighboring hexagon sites passing 
over one C + . The positively charged C + acts as a welcoming (favorable) potential valley for the 
negatively charged “electron”, while the same C + acts as a hindering potential hill for the positively 
charged “hole”. The “hole”, however, can move on a series of vacant sites in [001], each surrounded 
by six C + , thus never meeting the hindering potential hills. Thus, the easy channel directions for the 
electrons and holes are {110) and {100), respectively. 

We may choose a unit cell with side-length pair (b, c) as shown in Figure 8.4 . Then the Brillouin 
zone in the k -space is unique: a rectangle with side lengths (27i/b, 2tc/c). We note that the lattice has 
inversion (mirror) symmetry with respect to the x- and y-axis. 

Let us consider the system (graphene) at 0 K. If we put an electron in the crystal, then the electron 
should occupy the center O of the Brillouin zone, where the lowest energy lies. Additional electrons 
occupy the points neighboring O in consideration of Pauli’s exclusion principle. The electron 
distribution is lattice periodic over the entire crystal in accordance with the Bloch theorem 

Graphene is a quadrivalent metal. The first few low-lying bands are completely filled. The 
uppermost partially filled bands are important for discussion of the transport properties. We consider 
such a band. The Fermi surface, which defines the boundary between the filled and unfilled k-spaces 
(area) is not a circle since the x-y symmetry is broken. The electron effective mass is lighter in the 
direction [110] than perpendicular to it. Hence, the electron motion is intrinsically anisotropic. If the 
electron number is raised by the gate voltage, then the Fermi surface should grow more quickly in this 
direction with increasing number of electrons. Because of the inversion symmetry of the crystal the 
Fermi surface must approach perpendicular to the Brillouin boundary. As the gate voltage changed to 
the charge-neutral point, the Fermi surface should go through a “neck” configuration, where the 
density of states rapidly grows on both sides of the voltage, generating high densities of “electrons” 
and “holes.” Experiments [12] indicate that (a) both electrons and holes are excited in graphene, (b) 
at zero gate voltage, the electrons are dominant, and (c) the resistivity p exhibits a sharp maximum at 
the electron density n 2 - 2 x 10 11 cm 2 . Feature (b) should arise as follows. The negatively charged 

electron is close to the positive ions C + and the hole is farther away from C + . Hence, the gain in the 



















Coulomb interaction is greater for the “electron.” That is, the “electron” is more easily activated. 
Thus, the “electrons” are the majority carriers at zero gate voltage as observed. Feature (c) is related 
to the fact that the conductivity 

_i ne 2 z 

(8.16) ° = P m* J 

where n is the carrier density and t is the relaxation time, must decrease since the effective mass m* 
shoots up to oo in the small neck limit. 

We note that the “neck” Fermi surface was observed in copper, where densities of “electrons” and 
“holes” are high [13]. Necks in the Fermi surface are known to cause the Hall (Seebeck) coefficient 
to be negative (positive) [14]. It is interesting to see if graphene shows the same properties. The same 
easy channels in which the electron runs with a small mass, may be assumed for other hexagonal 
directions, [Oil] and [101]. The currents run in three channels (110) = [110], [Oil], and [101] and 
thus the system as a whole does not show anisotropy in transport properties. 

The current band theory based on the WS cell model predicts a gapless semiconductor and cannot 
explain this fact. The WS model is suited for the study of the ground state energy of a crystal. To treat 
the electron dynamics for a noncubic lattice such as a honeycomb lattice, we must introduce a 
Cartesian unit cell and use a mass tensor. We present a new theoretical model for electron dynamics. 
In this model the electron (hole) has a size of a unit carbon hexagon. 

We have seen that the “electron” and “hole” have different internal charge distributions, and they 
are not point particles. Hence, they have different effective masses nq and m 2 , which are different 

from the gravitational mass m= 9.11 x 10 -28 g. Which carriers are easier to activate or excite? This 
question can be answered without considering channeling. The electron is near the positive ions and 
the hole is farther away from the positive ions. Hence, the gain in the Coulomb interaction is greater 
for the electron. That is, the electrons are more easily activated (or excited). The electrons move in 
the welcoming potential well channels while the holes do not. This fact also leads to the smaller 
activation energy for the electrons. We may represent the activation energy difference by [15] 

(8.17) < n ■ 

The thermally activated (or excited) electron densities are given by 

(8i8)n J m = » J e ip (0). 

where j = 1 and 2 represent the electron and hole, respectively. The prefactor nj is the density at the 

high-temperature limit. The “electron” can move easily with a smaller effective mass in the direction { 
110) rather than perpendicular to it as we see presently. 


8.3 The Full Hamiltonian 

Fujita and his group developed a quantum statistical theory of superconductivity in a series of papers 
[16-20]. We present this theory (see Section 8.4) here again in the present section. 

In the ground state there are no currents for any system To describe a supercurrent, we must 
introduce moving pairons, that is, pairons with finite CM momenta. Creation operators for “electron” 
(1) and “hole” (2) pairons are defined by 



n <i)+ _ B mt . R m _ J2) (2> 

(8.19) = B tinii = c i c i • — c 4 H • 

We calculate the commutators among B and B ; , and obtain 

(8.20) [ B «- B «] = 0 ■ [ s ii’] i = 0 ’ 


[ 


Tjti) 


( 8 . 21 ) 


r U) r U)t 
c 2 c 4 

r (j) r t; It 

t, tj 

0 


if k\ = k j and fci = fc* 
if k, = kj and k 2 ^ fc* 
if ki ^ kj and kj = k 4 
otherwise. 


Pairon operators of different types j always commute: 

(8.22) [B 01 ' B(i) ] = 0 1f 


(8.23) B l j ' = ' 


««> = cUltcUl, 

2 hi l 


represent the number operators for “electrons” (j = 1) and “holes” (j = 2). 
Let us now introduce the relative and net momenta (k, q) such that 


(8.24) ts I (t, - t!) 


q = h + k 2 \ fci = k + y<|, k 2 = -k + j(J 


Alternatively we can represent pairon annihilation operators by 

R '|i| - nm - ji) r ro R 'W _ r m m 

(8.25) *9 ~ lltlzl “ ~ ^+9/21- -k+tfU ■ 

The prime on B will be dropped hereafter. In the k-q representation the commutation relations are re¬ 
expressed as 

[<Bi?,] = 0, = 0 . 

[ d Li) B Plt1_ 

18.26) LV^vJ- 

( (1 - + - W-Jt+9/24.) <5ji 

c <j) c <j)t d 

C -k+qjU C - t ' + <fj2l °J ‘ 


( 827 ) 


-<j) 


M) t 


C k+ q l2t C y+^flf & ji 
0 


if k — fc' and q — q' 
if k + J = k f + i 
and -k -1- f # — k' + i- 
ifk + f ^ k' + 4 
and -k 4- f = —fc" + £ 
otherwise. 


Using the new notation, we can rewrite the generalized Bardeen-Cooper-Schrieffer (BCS) 
Hamiltonian (7.23) as- 1 - 1 

k = E«+E« 

fe.S fe r 3 

-E'E'EM c <1+^ + <’«?,+^ -SB • 

(8.28) * * *' 

where v 0 is the pairing interaction strengths per volume among and between “electron” (1) and “hole” 

(2) pairons. This is the full Hamiltonian for the system of graphene, which can describe moving 
pairons as well as stationary pairons. Here, the prime on the summations indicates the restriction 
arising from the phonon exchange, see below. 










8.4 Moving Pairons 


The phonon-exchange attraction is in action for any pair of electrons near the Fermi surface. In 
general the bound pair has a net momentum, and hence, it moves. Such a pair is called a moving 
pairon. The energy w q of a moving pairon for the 2D case can be obtained (see Section 7.6) from 

»»»(*>*) = L e (| fe + f : ) + e ( - k + f|)]°(M) 


I'C 


r I d z k'a(k', q ), 


(8.29) 

which is Cooper’s equation. Equation (1) of his 1956 Physical Review Letter [21]. The prime on the 
k’-integral means the restriction on the integration domain arising from the phonon-exchange 
attraction, see below. We note that the net momentum q is a constant of motion, which arises from the 
fact that the phonon exchange is an internal process, and hence cannot change the net momentum. The 
pair wavefrmctions a(k, q) are coupled with respect to the other variable k, meaning that the exact (or 
energy-eigenstate) pairon wave functions are superpositions of the pair wavefrmctions a(k, q). 
Equation t8.29I can be solved as follows. We assume that the energy w q is negative: 

(8.30) < 0 ■ 

Then, 

Rearranging the terms in (8.29) and dividing by 

*(I*+!D + «(I- |+ !I) 

we obtain 


w a 


«(M) = [> (|* + §|) + <■ ( 

where 


—*+f|)—»?] 


-1 


C{q), 


«o 


(8.32) C<,) (2**) 2 


I d 2 k'a(k',q), 


which is k-independent. Introducing (8.31) in (8.29) . and dropping the common factor C(q), we 
obtain 


I = 


Vq 


■ I [,(|fc + ||) + £ (|- fc + !■) + Kl] 


(8.33) 

We now assume a free electron model in 3D. The Fermi surface is a sphere of the radius 
(momentum) 

(8.34) h = (2nt]£ F ) 1 / 2 , 

where m, represents the effective mass of an electron. The energy s(|k|) is given by 

(,35)^ — ^- 

The prime on the k-integral in t8.33I means the restriction: 

/i- ?i\ .( . a \ . 




















We may choose the polar axis along q as shown in Figure 8.5 . The integration with respect to the 
azimuthal angle simply yields the factor 2n. The k-integral can then be expressed by (Problem 8.4.1) 


Figure 8.5 The range of the interaction variables fk. 0") is limited to a circular shell of thickness k D . 
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(8.37) 0 

where k D is given by 



(8.38) ■ 

After performing the integration and taking the small-q and small-(k D /k F ) limits, we obtain (Problem 
8.4.2) 

rs.39) w i = m >+ 

where w 0 is given by (Problem 8.4.3) 


— Iflatn 

\l'-. = -:- 

t8.40f exp(2/(v c P(0))) -1 

As expected, the zero-momentum pairon has the lowest energy. The excitation energy is continuous 
with no energy gap. Equation 18.39f was first obtained by Cooper (unpublished) and it is recorded in 
Schrieffer’s book [22, Equation t2.151 ]. The energy w q increases linearly with momentum q(= |q|) for 

small q. This behavior arises from the fact that the density of states is strongly reduced with the 
increasing momentum q and dominates the q 2 increase of the kinetic energy. The linear dispersion 
relation means that a pairon moves like a massless particle with a common speed v F /2. This relation 

plays a vital role in the Bose-Einstein condensation (BEC) of pairons (see next section). 

Such a linear energy-momentum relation is valid for pairons moving in any dimension. However, 
the coefficients slightly depend on the dimensions; in fact 


t8.4U w^wo + cq, 

where c = l/2v F and 2/7iv F for three and two dimensions, respectively. 

Problem 8.4.1. Verify (837). 

Problem 8.4.2. Derive an energy-momentum relation f8.39f for the 3D case with the assumption of a 
Fermi sphere. Hint: Use the diagram in Figure 8.5 . 

Problem 8.4.3. Show that w 0 for the 3D case is given by f8.40k 


















8.5 The Bose-Einstein Condensation of Pairons 


In Section 8.3, we saw that the pair operators (B, B^) appearing in the full Hamiltonian w in 18.281 
satisfy rather complicated commutator relations 18.261 and 18.271 . In particular part of 18.261 

(8.42) [ B *°] = M - [ c -M c u] = ° 

reflects the fermionic properties of the constituting electrons. Here, B k0 ^ = bj/ represents creation 
operator for zero-momentum pairons. Bardeen, Cooper, and Schrieffer [23] studied the ground state 
of a superconductor, starting with the reduced Hamiltonian ^ 0 , which is obtained from the 

Hamiltonian ^ in 18.281 by retaining the zero-momentum pairons with q = 0 written in terms of b by 
letting B k0 ® = b k ®, that is, 

- E'E'^KW+ WW + W+*W f l. 

(8.43) Jt v L 

Here, we expressed the “electron” and “hole” kinetic energies in terms of pairon operators. The 
reduced Hamiltonian is bilinear in pairon operators (b, b f ), and can be diagonalized exactly. 
Bardeen, Cooper, and Schrieffer obtained the ground state energy E 0 as 

18.441 , 

where p(0) is the density of states at the Fermi energy. The w 0 is the ground state energy of the 

pairon, see 18.401 . Equation 18.441 means simply that the ground state energy equals the numbers of 
pairons times the ground state energy of the pairon. In the ground state of any system there is no 
current. To describe the supercurrent we must introduce moving pairons, that is, nonzero-momentum 
pairons. We first show that the center of masses of the pairons move as bosons. That is, the number 
operator of pairons having net momentum q 

= E = E 

(8.45) T T 
have the eigenvalues 

18.461 = ■ 

The number operator for the pairons in the state (k, q) is 

j- ]■ 

( 8 . 47 ) >lkq = = C k + ql2 C -k+qj2 C - t + l! lCk +lil ’ 

where we omitted the spin indices. Its eigenvalues are limited to zero or one: 

18.481 "&« = ° OTl - 

To explicitly see 18.461 . we introduce 


(8.49) 
and obtain 


a ,=E B >« 


(8.501 
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Although the occupation number is not connected with B q as n q ^ B q t B q , the eigenvalues n' q of n q 
satisfying 18.501 can be shown straightforwardly to yield [24] n' q = 0, 1,2, ... with the eigenstates 

(8.51)1°). |1) = B+|0) , |2) = fi+fljlO)..- . 

This is important. We illustrate it by taking a one-dimensional motion. The pairon occupation- 
number states may be represented by drawing quantum cells in the (k, q) space. From (8.48) . the 
number n' kq is limited to 0 or 1, see Figure 8.6 . The number of pairons characterized by the net 

momentum q only, n q , is the sum of the numbers of pairs at column q, and clearly it is zero or a 
positive integer. 

Figure 8.6 The number representation of many electron pairs in the (k, q) space. 

k 



In summary, pairons with both k and q specified are subject to the Pauli exclusion principle, see 
18.481 . Yet, the occupation numbers n' q of pairons having a CM momentum q are 0, 1,2, ... 

The most important signature of many bosons is the Bose-Einstein Condensation. Earlier we 
showed that the pairon moves with the linear dispersion relation, see 18.391 : 

2 

(8 52) Wp = + + ' 

where we designated the pairon net momentum (magnitude) by the more familiar p rather than q. 

Let us consider a 2D system of free bosons having a linear dispersion relation: s = cp, where c = 2/ 
7iv F . The number of bosons, N, and the Bose distribution function, 

(8.53) = e*l-‘) -1 = Mt) ( - 0)> 

are related by 

N = = N 0 + / B (c p ), 

18.541 P fp >° 

where p is the chemical potential and 

(8.55) No ^( e_ ' JJ<_1 ) 

is the number of zero-momentum bosons. Note that a = |3p with p = (k B T) -1 . The prime on the 

summation in 18.541 indicates the omission of the zero-momentum state. For notational convenience 
we write 






























(8.56) 
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We divide 18.541 by the normalization area L 2 , and take the bulk limit: 
(8.57) N —^ oo , L —^ oo while 
We then obtain 


NI -2 = n 


where n 0 = N 0 /L 2 is the number density of zero-momentum bosons and n the total boson density. After 
performing the angular integration and changing integration variables, we obtain from 18.58) 

OO 

2itft 2 c 2 /? 2 (n - n 0 ) - f dx .. _ 1 *. _ , x = fic . 

18.591 o 

Here X is the fugacity defined by 
(8.60) ^» (<!)■ 

We note that the fugacity X is less than unity for all temperatures. After expanding the integrand in 
18.591 in powers of A.e _x (< 1), and carrying out the x-integration, we obtain (Problem 8.5.1) 

„ =n _„ 

C8.6H * “ ° “ IntiW ' 

where <|> m (A,)is given by 


'Pin 

(8.62) 

We need <J) 2 (X,) here, but we introduced (|) m for later reference. Equation 18.611 gives a relation among 
X, n, and T. 

The function <J) 2 (X,) monotonically increases from zero to the maximum value c)) 2 (l) = 1.645 as X is 
raised from zero to one. In the low-temperature limit, X = 1, 4> 2 (A) = c|) 2 ( 1) = 1.645, and the density of 
excited bosons, n x , varies like T 2 as seen in 18.611 . This temperature behavior of n x persists as long 
as the rhs of 18.611 is smaller than n; the critical temperature T c occurs at n = k B 2 T c 2 ()) 2 (l)/27rh 2 c 2 . 
Solving this, we obtain 

68.634 Ifr = 1.954* on 1 ! 2 (= 1.24ft. 






k =1 


If the tenperature is raised beyond T c , the density of zero-momentum bosons, no, becomes 
vanishingly small, and the fugacity X can be determined from (Problem 8.5.2) 

_ _ T . r 

18.641 “ 2jtft2 C 2 ’ 

In summary, the fugacity X is equal to unity in the condensed region: T < T c , and it becomes smaller 
than unity for T > T c , where its value is determined from 18.641 . Formula 18.631 for the critical 
temperature T c is distinct from the famous BCS formula 


18.651 3.53^^ = 230, 

where A 0 is the zero-temperature electron energy gap in the weak coupling limit. The electron energy 
gap A(T) and the pairon ground state energy w 0 depends on the phonon-exchange coupling energy 




















parameter v 0 , which appears in the starting Hamiltonian ^ in (8.281 . The ground state energy w 0 is 
negative. Hence, this w 0 cannot be obtained by the perturbation theory, and hence the connection 
between w 0 and v 0 is very complicated. This makes it difficult to discuss the critical temperature T c 
based on the BCS formula (8.651 . 

Unlike the BCS formula, formula (8.631 is directly connected with the measurable quantities: the 
pairon density % and the Fermi speed v F . 

We emphasize here that both formulas (8.631 and (8.651 can be derived, starting with the 
Hamiltonian (8.281 and following statistical mechanical calculations, see more details in reference 
[25]. 

Problem 8.5.1. Verify (8.611 . 

Problem 8.5.2. Derive (8.641 . 

8.6 Superconductivity in Metallic SWNTs 

The pitch in a metallic SWNT contains an irrational number of hexagons and there is no lattice k- 
vector along the tube. The unrolled plane may extend along the tube axis (y-axis) and also along the 
circumference (x-axis) just like a graphene sheet. But there is a significant difference. The true 
graphene sheet has an inversion symmetry with respect to the plane. The rolled SWNT has an inside 
and outside. If “holes” are excited on the inside, then they can move along the tube length, providing 
an extra channel for the transport of charge. “Electrons,” which are negatively charged, cannot move 
in a straight line along the tube since they are attracted by the positively charged wall. We may 
assume a ID k-vector along the tube length. 

The contribution of the “hole” channel is significant. Transport experiments by Moriyama et al. [9] 
show that the main charge carriers are “holes.” In contrast, “electrons” are the normal charge carriers 
in graphene. Thermopower (Seebeck coefficient) measurements by Kong et al. [26] also provide 
evidence that the charge carriers are “hole”-like. 

Superconductivity occurs in a 2D (or 3D) crystal having “electrons” and “holes”. The unrolled 
configuration of a SWNT clearly indicates a possible 2D superconductor. Phonons and electrons 
share the same Brillouin zone. This affinity is important for the electron-phonon interaction. In fact 
phonon exchange binds electron pairs, called Cooper pairs (pairons). Since the phonon does not carry 
a charge, phonon exchange conserves the charge and can, and must, create (or annihilate) a pair of 
positive (+) and negative (-) pairons simultaneously. Hence, the numbers of ±pairons created are 
mutually equal. The -pairon, that is, the “electron” pair, moves with a higher speed 2hzv\ than the 

+pairon (“hole” pair) since 
(8.66) v> i > v 2 in graphene. 

Hence, there should be a supercurrent if the pairons undergo a BEC. In the superconducting state 
±pairons are condensed at the same momentum p n . The supercurrent density, calculated by (charge) x 

(density) x (speed), is 

fa 











where is the (-pairon) density. 

We have shown earlier in Section 8.2 that the “electron” and “hole” have different charge 
distributions and have therefore different effective masses (m l9 m 2 ). Then, the Fermi speeds (v 1? v 2 ) 

are different, and the supercurrent density is finite as seen in (8.67) . It is noteworthy that the 
supercurrent density j vanishes if we assume that the “hole” is an antielectron with the same mass. 
The fermion-antifermion symmetry must be broken for superconductivity to occur. 

The superconducting state becomes unstable if enough magnetic field is applied. This is so because 
± pairon motion (currents) is affected differently by the field. If the condensation (Meissner) energy is 
overcome by the magnetic field energy, the system reverts to the normal state. There is a critical field. 
As long as the binding energy is greater than the magnetic energy the system stays in a 
superconducting state by repelling the magnetic field, which is known as the Meissner effect. 

The pairon moves with a linear dispersion relation, see 18.411 . The BEC of the pairons occur in a 
2D graphene plane with the critical temperature T c , given by 18.631 . We note that no BEC (and no 

superconductivity) occurs in ID. 

8.7 High-Field Transport in Metallic SWNTs 

In 2000, Yao, Kane, and Dekker [27] reported high-field transport in metallic SWNT. In Figure 8.7 . 
we reproduced their I-V curves, after [27, Figure 1]. At low fields (voltage ~ 30 mV), the currents 
show temperature-dependent dips near the origin, exhibiting non-Ohmic behaviors while at high 
fields (~ 5 V) the resistance R versus the bias voltage V shows the relation: 

Figure 8.7 Typical current I and differential conductance dl/dV vs. voltage V obtained using (a) low- 
resistance contacts (LRC) and (b) high-resistance contacts (HRC). The inset in (b) plots dl/dV vs. V 
on a double-log scale for the HRC sample. 

After Yao et al [27], 
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(8.68) R = Ro + ^ (hlBhV) ' 

where R 0 and T 0 are constants. The authors discussed this low-field behavior in terms of a ID 

Luttinger liquid model. Many experiments, however, indicate that electrical transport in SWNTs has a 
two-dimensional (2D) character [6]. In fact, the conductivity in individual nanotubes depends on the 
circumference and the pitch characterizing a space curve (2D). Hence, nanotube physics requires a 
2D theory. In the present work, we present a unified microscopic theory of both low-and high-field 
conductivities. Carbon nanotubes were discovered by Iijima [6]. The important questions are how the 
electrons or other charged particles traverse the nanotubes and whether these particles are scattered 
by inpurities and phonons or not. To answer these questions, we need the electron energy band 
structures. Wigner and Seitz [28] developed the WS cell model to study the ground state of a metal. 
Starting with a given lattice, they obtain a Brillouin zone in the k-space and construct a Fermi surface. 
This method has been successful for cubic crystals including the face-centered cubic (fee), the body- 
centered cubic (bee), diamond (dia) and zincblende lattices. If we apply the WS cell model to 
graphene, we then obtain a gapless semiconductor, which is not experimentally observed [6]. 

SWNTs can be produced by rolling graphene sheets into circular cylinders of about 1 nm diameter 
and microns (urn) in length [1,2]. Electrical conduction in SWNTs depends on the circumference and 
pitch as shown in Figure 6.1 . Chapter 6, and can be classified into two groups: semiconducting or 
metallic [6]. In our previous work [29], we have shown that this division into two groups arises as 
follows. A SWNT is likely to have an integral number of carbon hexagons around the circumference. 
If each pitch contains an integral number of hexagons, then the system is periodic along the tube axis, 
and “holes” (not “electrons”) can move along the tube. Such a system is semiconducting and its 
electrical conductivity increases with temperature, and is characterized by an activation energy e 3 

[30]. The energy s 3 has a distribution since both pitch and circumference have distributions. The pitch 



















angle is not controlled in the fabrication process. There are numerous other cases where the pitch 
contains an irrational number of hexagons. In these cases, the system shows a metallic behavior which 
has been experimentally observed [9]. 

We primarily deal with metallic SWNTs in the present work. Before dealing with high-field 
transport, we briefly discuss low-field transport. Tans et al. [5] measured the electric currents in 
metallic SWNTs under bias and gate voltages. Their data from [5, Figure 2], are reproduced in Figure 
8.1 . where the currents versus the bias voltage are plotted at three gate voltages: A (88.2 mV), B 
(104.1 mV), and C (120.0 mV) tshown in Figure 8. I T 

Significant features are listed in Section 8.1. Tan et al. [5] interpreted the flat currents near V bias = 0 

in Figure 8.1 in terms of a ballistic electron model [6]. We proposed a different interpretation of the 
data in Figure 8.1 based on the Cooper pair [21] (pairon) carrier model. Pairons move as bosons, and 
hence they are produced with no activation energy factor. All features (a)-(e) listed in Section 8.1 
can be explained simply with the assumption that the nanotube wall is in a superconducting state (see 
Section 8.1). 

In the currently prevailing theory [6], it is argued that the electron (fermion) motion becomes 
ballistic at a certain quantum condition. But all fermions are known to be subject to scattering. It is 
difficult to justify why the ballistic electron is not scattered by inpurities and phonons, which 
naturally exist in nanotubes. Yao, Kane, and Dekker [27] emphasized the importance of phonon 
scattering effects in their analysis of their data in Figure 8.7 . The Cooper pairs [21] in supercurrents, 
as is known, can run with no resistance (due to inpurities and phonons). Clearly the experiments on 
the currents shown in Figure 8.7 are temperature-dependent, indicating the importance of the 
electron-phonon scattering effect If the ballistic electron model is adopted, then the phonon scattering 
cannot be discussed within the model’s framework. We must go beyond the ballistic electron model. 

If the SWNT is unrolled, then we have a graphene sheet, which can be superconducting at a finite 
temperature. We studied, in Section 6.2, the conduction behavior of graphene, starting with the 
honeycomb lattice and introducing “electrons” and “holes” based on the orthogonal unit cell. Phonons 
are generated based on the same orthogonal unit cell. In Sections 4.3 and 4.4, we treated phonons and 
phonon-exchange attraction. In Section 8.3, we constructed a Hamiltonian suitable for the formation 
of the Cooper pairs and derived the linear dispersion relation for the center of mass motion of the 
pairons (see Section 8.4). The pairons moving with a linear dispersion relation undergo a BEC in 2D 
(see Section 8.5). In the next section we discuss the zero-bias anomaly (ZBA) observed in Figure 8.7 . 

8.8 Zero-Bias Anomaly 

The unusual current dip at zero bias in Figure 8.7 is often called the zero-bias anomaly. This effect is 
clearly seen in the low-resistance contacts (LRC) sample. The differential conductance dl/dV 
increases with increasing bias, reaching a maximum at V ~ 100 mV With a further bias increase, 
dl/dV drops dramatically. See Figure 8.7a . We will show that the ZBA arises from a breakdown of 
the superconducting state of the system 

With no bias, the nanotube’s wall below ~ 100 K is in a superconducting state. If a small bias is 
applied, then the system is charged, positively or negatively depending on the polarity of the external 
bias. The applied bias field will not affect the neutral supercurrent but can accelerate the charges at 












the outer side of the carbon wall. The resulting normal currents carried by conduction electrons are 
scattered by inpurities and phonons. The phonon population changes with temperature, and hence the 
phonon scattering is temperature-dependent. The normal electric currents along the tube length 
generate circulating magnetic fields, which eventually destroy the supercurrent running in the wall at a 
high enough bias. Thus, the current I (pA) versus the voltage V (mV) is nonlinear near the origin 
because of the supercurrents running in the wall. The differential conductance dl/dV is very small and 
nearly constant (superconducting) for V < 10 mV in the high-resistance contacts (HRC) sample, see 
Figure 8.7b . We stress that if the ballistic electron model [6] is adopted, then the scattering by 
phonons cannot be discussed. The nonlinear I-V curves below 150 K mean that the carbon wall is 
superconducting. Thus, the clearly visible temperature effects for both LRC and HRC samples arise 
from the phonon scattering. We assumed that the system is superconducting below ~ 150 K. The ZBA 
arises only from the superconducting state. The superconducting critical temperature T c must then be 

higher than 150 K. An experimental check of T c is highly desirable. 

8.9 Temperature Behavior and Current 
Saturation 

Yao et al. [31] extend the I-V measurements up to 5 V as shown in Figure 8.8 . Strikingly, the I-V 
curves at great bias measured at different temperatures between 4 K and room temperature overlap 
each other. This temperature behavior is consistent with our picture that the superconductivity state of 
the metallic SWNT continued throughout the temperature range measured. Thus, the superconductivity 
temperature T c must be higher than room temperature. 

Figure 8.8 Large-bias I-V characteristics at different temperatures using low-resistance contacts for 
a sample with an electrode spacing of 1 pm The inset plots R = V/I vs. V. 

After Yao, et al. [31]. 
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From the shape of the I-V curves in Figure 8.8 . it is clear that the trend of decreasing conductances 
continues to high bias. Extrapolating the measured I-V curves to higher voltage would lead to a 













current saturation, that is, a vanishing conductance. The current saturation may arise as follows. When 
the bias is raised from zero, the system will be charged with “holes.” The resulting “hole” currents 
run along the outer side of the tube. There is an extra contribution to the current I. But these currents 
must saturate since the maximum “hole” states are limited by the Pauli exclusion principle. The 
maximum number of “holes” must be much smaller than the number of C’s. 

8.10 Summary 

Various non-Ohmic behaviors observed in metallic SWNTs under bias and gate voltages were 
discussed based on the bosonic pairon model in this chapter. The ballistic electron model was 
avoided since this model cannot explain by itself why the ballistic fermionic electrons are not 
scattered by inpurities and phonons. In our microscopic theory we start with a honeycomb lattice for 
graphene, establish a Fermi surface, identify the charge carriers, and discuss the electrical transport 
using kinetic theory. 

We have uncovered a number of significant facts: 

• “Electrons” and “holes” move with different masses in anisotropic environments in graphene 
(and graphite). In contrast “electrons” and “holes” in germanium (Ge) and silicon (Si) move in 
isotropic environments. 

• For electron dynamics it is important to introduce a rectangular (non-WS) unit cell for the 
honeycomb lattice. Only then can we establish that “electrons” and “holes” move with different 
effective masses (m 1? m 2 ) and different activation energies (e l5 e 2 ). Graphene is intrinsically 

anisotropic. 

• A metallic (semiconducting) SWNT contains an irrational (integral) number of carbon hexagons 
in each pitch. The former case occurs more often. 

• The unrolled configuration of the metallic SWNT is a 2D graphene sheet indefinitely extending 
over the tube length direction and the circumference direction. Electrical transport in the wall 
occurs in 2D (not in ID). 

• “Holes” (and not “electrons”) can move inside the tube, and this “hole” channel is significant as 
observed in the thermopower experiments. 

• Electrons and phonons move in the same k-space within the (first) Brillouin zone. This affinity 
makes the electron-phonon interaction important. 

• The phonon exchange between electrons binds Cooper pairs (pairons). Positively and negatively 
charged (±)pairons are created (or annihilated) in pairs simultaneously. The number of ±pairons 
are mutually equal at 0 K. 

• The pairons move with the linear dispersion relation. 

• The CM of the pairons move as bosons. 

• The system of the pairons moving in 2D with the linear dispersion relation undergoes a BEC 
with the critical temperature T c given in t8.63T 

• A superconducting state is generated by the BEC of ±pairons at the CM momentum p n = 27ihn/L. 

The integer n is small and the tube length L is macroscopic. 

• The supercurrent density is given by (8j67). The current is finite only if the “electron” and the 
“hole” have different masses. 




• The supercurrent is neutral, and hence it cannot be accelerated by a bias electric field. 

• The supercurrent is destroyed if a strong magnetic field is applied. There is a critical field. 

• A small magnetic field is repelled by the superconducting carbon tube. 

• The superconducting state is destroyed by raising the temperature. It is important to find the 
critical temperature T c above which the electrical conduction becomes normal (resistive). 
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Chapter 9 


Magnetic Susceptibility 


The electron has mass m, charge —e, and a half-spin. Hence, it has a spin magnetic moment. Pauli 
paramagnetism and Landau diamagnetism of the conduction electrons are discussed in this chapter. 


9.1 Magnetogyric Ratio 

Let us consider a classical electron’s motion in a circle in the x y-plane as shown in Figure 9.1 . The 
angular momentum 1 = r x p points towards the positive z-axis and its magnitude is given by 


Figure 9.1 An electron with a circular motion generates a magnetic moment p proportional to its 
angular momentum 1 = mr x v. 



(9.1) 1*1 = mrv ■ 

According to the electromagnetism, a current loop generates a magnetic moment p (vector) whose 
magnitude equals the current times the area of the loop, and whose direction is specified by the right- 
hand screw rule. The magnitude of the moment generated by the electron motion, therefore, is given 
by 


Current x Area = () x (jtr 2 ) = -evr r 
? it r / 7 


(9.2) \2htS ' ' ’ 2 

and the direction is along the negative z-axis. We observe here that magnetic moment p is 
proportional to the angular momentum 1. We may express this relation by 


(93} V = 

This relation, in fact, holds not only for this circular motion but also in general. The proportionality 
factor 


-e 

(9.4) a= 2^i 

is called the magnetogyric or magnetomechanical ratio. We note that the ratio is proportional to the 
charge —e and inversely proportional to the (electron) mass m. 









We assume that a magnetic field B is applied along the positive z-axis. The potential field energy y 
of a magnetic dipole is given by 

19.51 V = —fiB cos 8 = -fi z B , 
where 0 is the angle between the vectors p and B. 

We may expect that a general relation such as 19.31 holds also in quantum theory. The angular 
momentum (eigenvalues) is quantized in the units of h. The electron has a spin angular momentum s 
whose z-component can assume either l/2h or -l/2h. Let us write 

(9.6) 5 ‘ = T* = T'"' = < = ±l ' 

Analogous to (9.3) we assume that 

(9.7) 0< <X ff . 

We shall write this quantum relation in the form 

(9.81 ^ = 
where 

F f) 

(9 9 ) flB = ~ = 9-27 * 10_21 erg ' G_1 (CGS ™ its) ’ 

called the Bohr magneton,^ has the dimensions of the magnetic moment. The constant g in 19.8) is a 
numerical factor of order 1, and is called a g-factor. If the magnetic moment of the electron is 
accounted for by the “spinning” of the charge around the z-axis, then the g-factor should be exactly 
one. However, the experiments show that this factor is 2. This phenomenon is known as the spin 
anomaly, which is an i mportant indication of the quantum nature of the spin. 

In the presence of a magnetic field B the electron whose spin is directed along B, called the 
electron with up-spin, will have a lower energy than the down-spin electron whose spin is directed 
against B. The difference is, according to 19.51 and 19.81 . 

(9 10) Ae = ~ ■ 

For B = 7000 G and g = 2, we obtain the numerical estimate Ae/k B ~ 1 K. 

If an electromagnetic wave with the frequency v satisfying hv = As is applied, then the electron may 
absorb a photon of the energy hv and jump to the upper energy level. Figure 9.2 illustrates this 
phenomenon, which is known as the electron spin resonance (ESR). The frequency corresponding to 
As/kg = 1 K is 

Figure 9.2 Electron spin resonance (ESR). An electron with the up-spin may absorb a photon of the 
energy hv and jump to the upper energy level by flipping its spin if As = hv. 



-AA/V* 1 > Af = hv 

—f— J 

(9.11) v — 2-02 x 10 10 cycles s -t . 

This frequency falls in the microwave region of the electromagnetic radiation spectrum 













9.2 Pauli Paramagnetism 


Let us consider an electron moving in free space. The quantum states for the electron can be 
characterized by momentum p and spin o(= ±1). If a weak constant magnetic field B is applied along 
the positive z-axis, then the energy e associated with the quantum state (p, a) is given by 

where the second term arises from the electromagnetic interaction (see 19.51 and 19.81 1. Since g = 2 
for the electron spin, we may simplify 19.121 to 

(9.13) f = ' 

This expression shows that the electron with up-spin (a = +1) has a lower energy than the electron 
with down-spin (a = -1). We say that the spin degeneracy is removed in the presence of a magnetic 
field. 

Let us now consider a collection of free electrons in equilibrium At the absolute zero temperature, 
the states with the lowest energies will be occupied by the electrons, the Fermi energy s F providing 

the upper limit. This situation is schematically shown in Figure 9.3 . where the density of states, v + (s) 
and £>-(£), for electrons with up- and down-spins are drawn against the energy 8. The density of states 
for free electrons was discussed in Section 3.6. In the absence of the field, both D + (e) and d_( 8 ) are 
the same and are given by one-half of 13.631 : 


Figure 9.3 The density of states v + and p_, for the free electrons with up (f) and down (j) spins, are 

drawn against the energy e, which is measured upwards, (a) When B = 0 the two halves of the Fermi- 
Dirac distribution are equal, and thus M = 0; (b) when a field B is applied, spins in the antiparallel 
half flip into the parallel half, resulting in a net parallel magnetization. 

B= 0 
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Because of the magnetic energy -p B B, the curve for the density of states, £i + (e), for electrons with 
up-spins will be displaced downward by p B B compared with that for zero field and will be given by 


(9.15) 
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m 3 / 2 

V2jt 2 A 3 


+ l *B B5) , 


£ > —■ 





















Similarly the curve for the density of states, vX £ )> for electrons with down-spins is displaced by 
(9.16) ® -W = = ®" (e '~^ B) ’ ' - • 

From Figure 9.3 . the numbers N ± of the electrons with up- and down-spins are given by 


£ F 




N ± =V f deV ± (E) = V f dxVo(x) (x = e±^B) . 

(9.17) TubB o 

The difference N + - N. generates a finite magnetic moment for the system. Each electron with up-spin 
contributes p B , and each electron with down-spin contributes -p B . Therefore, the total magnetic 
moment is N + p B - N.^ . Dividing this by volume V, we obtain, for the magnetization, 

i = ^[N + -N_l 


C p +.id J1 E 


fp— 


= ^b| f dx23o[x]- / djc!?o{*) I — 2 /<bBI?o(ef) , 

(9.18) V ° 0 / 

where we retain the term proportional to B only. Using (9T4), we can re-express this as follows: 

_ y2ft ]S ffl 1 yZg q 

(9.19) jtW F 

The last expression shows that the magnetization is positive, and proportional to the field B. That is, 
the system is paramagnetic. The susceptibility % defined through the relation 


(9.20) l 
is given by 

jj2 

(921} ep ■ 

By using the relation 

2 -Jim-* 1 lt2 
(9.22) H f 

we can rewrite f9.2D as (Problem 9.2.1) 

3 

(925) = 2 “' 

This result was first obtained by Pauli [1] and is often referred to as Pauli paramagnetism. We can 
easily extend the theory to a finite temperature case (Problem 9.2.2). 

We note that Pauli paramagnetism is weaker than the paramagnetism of isolated atoms 
approximately by the factor k B T/s F (if this factor is small). 

Problem 9.2.1. Derive f9.23V 

Problem 9.2.2. Derive the magnetization at finite temperatures. Hint: The magnetization at a finite 
temperature is given by 

OO 

l - -.MOt) - (”;)) - «b j dcI>(c)[/ F (jE — B) — JFf{£-Mb5)] . 

o 

where f F is the Fermi distribution function and d(s) is the density of states in energy space. 













9.3 The Landau States and Levels 

A classical electron circulates around an applied magnetic field. This is often called cyclotron 
motion. The important quantum effect is the quantization of the cyclotron motion. Let us calculate the 
energy levels of an electron in a constant magnetic field B(= V x A). We choose the vector potential 
(9.24) ^ A y, A z ) (0, Bx , 0) , 

which yields a constant field B in the z-direction (Problem 9.3.1). The Hamiltonian H is then given by 
(Problem 9.3.2) 

ms h = t^ 1p+ £j4|!=1 W + +eBx]!+ & ■ 

By replacing the classical momentum p with the corresponding momentum operator -ihV in (9.25) . 
the Schrodinger equation can now be written down as 


(926) 


h 2 

r a 2 

f 3 

i tB \ 

2 a 2 " 

2 m 
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+-*) 

+ aF 


p — . 


Since the Hamiltonian ^ contains neither y nor z explicitly, we assume a wavefunction of the form: 
(9.27) V( X ’Y' Z ) = *(*)«P[-U*yy + . 

Substituting this expression into (9.26) yields the following equation for (j)(x) (Problem 9.3.3): 


(928) L 


2m dx 2 


1 2 { hk y \ 

+ 2 m,al { X -7B) 


<p(x) = £i<£(*:), 


2r,2 


h 2 k 

(929) £1 = F “ 2m ' 

Equation (9.281 is the energy-eigenvalue equation for a harmonic oscillator with the cyclotron 
frequency 

tB 

(9.30) = 3iT 

and the center of oscillation displaced from the origin by Xq = h ky/(eB). The energy eigenvalues are 
given by 


£1 = 


(» L + i), 


N L = 0,1,2,... 


an) 

Combining this with (929), we obtain 


£ = £ 




N l = 0,1,2,... 


/ „ 1 \ * £ kj 

— fito,- i ftf[_ — I -+■ 

(9.32) . V 2} 2m 

These energy eigenvalues are called the Landau Levels (LL) [2], The corresponding quantum states, 
called the Landau states, are characterized by the quantum number (N L , ky, kj. We note that the 

energies do not depend on k y , and they are therefore highly degenerate. The Landau states are quite 

different from the momentum eigenstates. This has significant consequences on magnetization and 
galvanomagnetic phenomena. An electron in a Landau state may be pictured as circulating with the 
angular frequency ro c around the magnetic field. If radiation having a frequency equal to ro c is applied, 

the electron may j ump up from one Landau state to another by absorption of photon energy equal to 
hco c . This generates the phenomenon of cyclotron resonance. 



















Problem 9.3.1. Show that the vector potential given by 19.24) generates a magnetic field pointing in 
the positive z-direction. For a constant magnetic field B we can choose the vector potential A = \ B x 
r. Show this by explicitly calculating V x A. 

Problem 9.3.2. Derive the Hamiltonian 19.25) . 

Problem 9.3.3. Derive 19.28) from 19.26) and 19.27) . Solving 19.28) . obtain its energy eigenvalues 

(Ml). 


9.4 Landau Diamagnetism 


The electron always circulates around the magnetic flux so as to reduce the magnetic field. This is 
called the motional diamagnetism If we calculate this effect classically by considering the system 
confined to a closed volume, we obtain zero magnetic moments. This is known as van Leeuwen’s 
theorem We first demonstrate this theorem 

Let us take a system of free electrons confined in a volume V The partition function per electron is 
1 f ,2 f ,i r (p 4" 




(9.33) v 
We introduce kinetic momentum II: 

(9.34) wr = mv = p + cA = FI = {IT X , n yt H z ) . 

After simple calculations, we obtain 

(9 35) dxdydzdp^dp^dpz — dxdydzd/7 x d/7 y d/J z . 

Using the last three equations, we can show that Z(B) is equal to the electron partition function with 
no field; 

(9.36) Z[B) = Z[B = 0 ). 

In 1930, L.D. Landau [2] showed that quantization of electron circulation yields a diamagnetic 
moment. We shall demonstrate this below. 

Electrons obey Fermi-Dirac statistics. Considering a system of free electrons, we define the free 
energy F as (Problems 9.4.1 and 9.4.2) 

F = - 2t B r V In [l + , 

19.371 T L J 

where the factor 2 arises from the spin degeneracy and Sj is the Landau energy given by 19.32) . The 

chemical potential p is determined from the condition: 

ap 

19.38) ~ 

The total magnetic moment M for the system can be found from 

M _J± 

19.39) W “ 3B ' 

Equation 19.38) is equivalent to the usual condition (Problem 9.4.3) that the total number of electrons, 
N, can be obtained in terms of the Fermi distribution 
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N = 2E/<*‘1- 

(md r 

The Landau energy e i is characterized by the Landau oscillator quantum number N L and the z- 
component momentum p z (= hk^. The energy E becomes continuous in the bulk limit. Let us introduce 
the density of states xj(g) (= dyy/de) such that 

!>{c )dr = the number of states having 
(9.42) an energy between r and e + dr . 

We now write ( 9.37 ) in the form 

OO 

F= Nu- 2 k,j I / dr In [l + . 

(9A$} 0 dE 

The statistical weight (number) w is the total number of states having energy less than 

E — ft(o c (N] + ^ ) 

(9.44) V - 2 ) 

Conversely, the allowed values of p z are distributed over the range in which |p x | does not exceed 
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2m ‘ 


jlm \ e - (n l + tW c 1! / ■ 

(9.45)1 L V 2 ) Jl 

For a fixed pair (s, N L ) the increment in the weight, dyy, is given by 


dvv = eB 


I1I2 / , r.j 
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= eB- 
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(9A6) i 2 ^) 1 

where V = L^Lj is the volume of the container. After summing (9A6) with respect to N L , we obtain 
W(e) = A { t£L 1 £ - (2N l + 1)11 , 

19 . 47 ) VIji k ,_„ 

where 


A = V 


(iTim) 2 ! 1 


E* = 


2jt r 


(9.48) i 2 ^) 1 1 ' hue 

We assume high Fermi degeneracy such that 
t9.49^ V — f F » for a metal. 

The sum over N L in t9.471 converges slowly. We use Poisson’s summation formula [3, 4] and, after 
mathematical manipulations, obtain 

(9.50) Wfr) = Wo 4- Wi + HW, 
where 

= A^-e^ , 

(9.51) 3 v jt 

yy, = ,, 1 

(9.52) 2^ ^ ' 
























( 9 , 53 ) 
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The detailed steps leading to (9.501 through (9.531 are given in the Appendix A.5. The term wq, 

which is independent of B, gives the weight equal to that for a free electron system with no field. The 
term w L is negative (diamagnetic) and can generate a diamagnetic moment. We start with (9.431 . 

integrate by parts, and obtain 


F = Np - 2 j ds W(e)/(e} 

0 

oc € 

= Nfi + 2 jj dc'Wfr') . 


(9.541 o o 

The -df/de, which can be expressed as (Problem 9.4.2) 
d/ 1 


—— = ———sedv 


(if ■ 


(9.55) de 4fe n r 
has a peak near s = p if k B T « p, and 

OQ 

(9.56) / d ‘ T "~ 1 ' 

For a smoothly changing integrand in the last member of (9.541 -d fids can be regarded as a Dirac 
delta function: 


(9.57) d £ “ 


v) ■ 


Using this property and (9.541 and (9.391 . we obtain (Problem 9.4.4) 

Fi — A — = {fta» c ) 2 Fj / 2 . 

(9.581 <>^T ' F 

Here we set p = s F . This is justified since the corrections to p(B, T) start with the B 4 term and with a 
T 2 term Using (9.581 and (9.491 . we obtain 


! 3M l t 3 2 F l 

59 ^ * Ulldau 3B 3B* 
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where n is the free electron density. 

Comparing this result with (9.231 . we observe that Landau diamagnetism is one third of the Pauli 
paramagnetism in magnitude: XLandau = U-Xpauli- But the calculations in this section are done with the 

assumption of the free electron model. If the effective mass (m*) approximation is used, formula 
(9.591 is corrected by the factor (m*/m) 2 as we can see from (9.581 . Hence, the diamagnetic 
susceptibility for a metal is 


imtaL 

Am [ jndau 


-(f ) 1 


X [indau 


(9.60) 

We note that the Landau susceptibility is spin-independent. For a metal having a small effective mass, 
the Landau susceptibility ^ L metal can be greater in magnitude than the Pauli paramagnetic 
susceptibility % Pauli . Then the total magnetic susceptibility expressed as 


























(9.61) X = £EWi + Xlandiu 

can be negative (diamagnetic). This is observed for GaAs (m* = 0.07m). 

The oscillatory term w osc in 19.531 yields the de Haas-van Alphen oscillations, which will be 
discussed in Section 10.1. 

Problem 9.4.1. The grand partition function E is defined by 

E =TR{exp(aN-£H)}= e aN Tr iJV {exp(c(N - , 

K=0 

where H N is the Hamiltonian of the N-particle system and Tr stands for the trace (diagonal sum). 

The symbol TR means a grand ensemble trace. We assume that the internal energy E, the number 

density n, and the entropy S are respectively given by 

„ , TM _ TR{Hexp(aJV - 
t - {tii -—-. 

(N) _ TR{Nexp(cfN-/?H]l 
” ~~ ~ ~ VE ' 

S = fegfln Z + fiE — a (N)] . 

Show that 

F = E — TS-p(N) - t H Tln 3 , 

where p = k B Ta is Gibbs free energy per particle: G = E - TS + PV = p{N}. 

Problem 9.4.2. 

1. Evaluate the grand partition function S for a free electron system characterized by H = Zj=i N 

Pj 2 /(2m). 

2. Show that 


W=^UlE = 2£/( tf ), E = -^lnZ = 2'£s,ft Cp ), 

P P 

where 2 is the spin degeneracy factor, 8p and f(s) are respectively given by 


= m =[ ^ r] ~ ir ‘ 


fl = Sr- 


Problem 9.4.3. Using the free energy F in 19.371 . obtain 19.401 and 19.411 . 
Problem 9.4.4. Verify 19.581 . 
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Chapter 10 


Magnetic Oscillations 


The de Haas-van Alphen oscillations in susceptibility are often analyzed using Onsager’s formula, 
which is derived. The statistical mechanical theory of the oscillations for the quasifree electron 
moving in 3D and 2D is developed in this chapter. 

10.1 Onsager’s Formula 

A metal is a system in which the conduction electrons, either “electrons” or “holes,” move without 
much resistance. “Electrons” (“holes”) are fermions having negative (positive) charge q = - (+)e and 
positive effective masses m*, which respond to the Lorentz force F = q(E + v x B). These conduction 
electrons are generated in a metal, depending on the curvature sign of the metal’s Fermi surface. Their 
existence can be checked by the linear heat capacity at the lowest temperatures. As we saw in 
Chapter 3, each metal has a Fermi surface, often quite complicated. 

The most frequently used means to probe the Fermi surface is to observe de Haas-van Alphen 
(dHvA) oscillations and analyze the data using Onsager’s formula (10.2) . A magnetic field of the 
order 1 T = 10 4 G is applied in a special lattice direction. The experiments are normally carried out 
using pure samples at the liquid helium temperatures to reduce the inpurity and phonon scattering 
effects. 

The magnetic susceptibility % is defined by 

( 10 . 1 ) I = xB , 

where I is the magnetization (magnetic moment per unit volume) and B the applied magnetic field 
(magnitude). When the experiments are done on pure samples and at very low temperatures, the 
susceptibility % in some metals is found to exhibit oscillations with varying magnetic field strength B. 
This phenomenon was first discovered in 1930 by de Haas and van Alphen [1, 2] in their study of 
bismuth (Bi), and it is called de Haas-van Alphen oscillations. Magnetic oscillations in 
magnetoresistance (MR) similar to the dHvA oscillations are called Shubnikov-de Haas (SdH) 
oscillations, which is discussed in Section 10.5. As we shall show, these oscillations have a quantum 
mechanical origin. Currently, analyses of dHvA oscillations are done routinely in terms of Onsager’s 
formula (10.2) . According to Onsager’s theory [3], the nth maximum (counted from 1/B = 0) occurs 
for a field B given by the relation 

1 A 1 , A 

W -L y — __ = _ ip.-, _ 

f IQ 2xhe B (2jtA) 2 B ’ 

where A is any extremal area of intersections between the Fermi surface and the family of planes B • 
p = B • (hk) = constant, and y is a phase (number) less than unity. The constant 
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is called the electron flux quantum. 

As an example, consider an ellipsoidal Fermi surface, as shown in Figure 10.1 a : 


Figure 10.1 Fermi surfaces: (a) ellipsoid, (b) hyperboloid (neck), (c) cylinder. 



The subscript “F” on e will be omitted hereafter in this section. Assume that the field B is applied 
along the p 3 -axis. All the intersections are ellipses represented by 

e - ^ = constant). 

(10.5) 2m i 2m t 2m 2 ^ 

The maximum area A of the intersection occurs at p 3 = 0 (the “belly”; see Figure 10. la ), and its area 
A is 

H0.6) A= Jt(2m 1 e) 1/2 (2m 2 e) 1/2 = 2jr(mim 2 ) 1 ^e . 

Using (10.2) and solving for s, we obtain 

(10.7) E = <5B(mim 2 ) -1/2 (n + y)* - 

which indicates that the energy 8 is quantized as the energy of the simple harmonic oscillator with the 
angular frequency 
eB 

(10.8) = (f»im 2 )V2 ■ 

As a second example, we take a hyperboloidal Fermi surface that can be represented by (10.4) with 
m 1? m 2 > 0 and m 3 < 0. Assume the same orientation of B. Equations (10.51 and (TO.6) then hold, 

where the area A represents the minimal area of the intersection at p 3 = 0 (the “neck”; see Figure 
10.1b ). As a third example, assume that m 3 = <x> in (10.4k which represents a Fermi cylinder (see 
Figure 10.1c ). In this case the area A is given by (10.61 for the same orientation of B. 

All three geometrical shapes are discussed by Onsager in his correspondence [3]. At the time of 
writing in 1952, only the ellipsoidal Fermi surface was known in experiments. Today we know that 
all three cases occur in reality. When tested by experiments, the agreements between theory and 
experiment are excellent. The cases of ellipsoidal and hyperboloidal surfaces were found in noble 
metals such as copper (Cu), silver (Ag), and gold (Au). The dHvA oscillations in Ag are shown in 
Figure 10.2 . where the susceptibility % is plotted against B 1 in arbitrary units, after Schonberg and 
Gold [4, 5]. The magnetic field is along a (111) direction. The two distinct periods are due to the 
“neck’ and “belly” orbits indicated, the high-frequency oscillation coming from the larger belly orbit. 
By counting the number of high-frequency periods in a single low-frequency period, for example, 

































between the two arrows, we can deduce directly that A m (belly)/A m (neck) = 51, which is most 
remarkable. 


Figure 10.2 The dHvA oscillations in silver with the magnetic field along a {111) direction, after 
Schonberg and Gold [4, 5]. The two distinct periods are due to the neck and belly orbits indicated in 
the inset. 



Onsager’s derivation of (40.21 in his original paper [3] is quite illuminating. Let us follow his 
arguments. For any closed k-orbit, there should be a closed orbit in the position space, called a 
closed r-orbit. The periodic component of the motion, which involves the components of p = hk and r 
perpendicular to B, is quantized. We apply the Bohr-Sommerfeld quantization rule 

to the r-orbit. The magnetic moment p is proportional to the angular momentum j: 

(10.10) ^ = «j , « = constant. 

The cross section Q of the r-orbit is determined such that the enclosed magnetic flux ®, given by B O, 
equals (n + y) times the flux quantum ® 0 = h/e: 

( 10 . 11 ) ^ = (» + r)®* = ■ 

This is Onsager’s magnetic flux quantization. 

For a free electron the closed circular path in the k-space perpendicular to the field becomes a 
similar path in the r-space, turned through a right angle, and with the linear dimension changed in the 
ratio (eB) -1 (Problem 10.1.1). This may hold for a nearly circular closed orbit. If we assume this 

relation for the Bloch electron, the area enclosed by the closed k-orbit, A, is proportional to that 

enclosed by the closed r-orbit, O: 

(10.12) A = ■ 

Combining the last two equations, we obtain (10.21 . 






























The most remarkable argument advanced by Onsager is that an electron in a closed r-orbit may 
move, keeping a finite number n of flux quanta, each carrying O 0 = h/e, within the orbit. This comes 

from the physical principle that the magnetic field B does not work on the electron and, therefore, the 
field does not change the electron’s kinetic energy. This property should hold for any charged 
particle. The flux quantization for the Cooper pair in a superconductor was observed in 1961 by 
Deaver and Fairbank [6] and Doll and Nabauer [7]. Because the Cooper pair has charge (magnitude) 
2e, the observed flux quantum is found to be - h/(2 e), that is, half the electron flux quantum ® 0 

defined in 110.31 . The phase y in 110.91 can be set equal to 1/2. This can be deduced by taking the 
case of a free electron for which quantum calculations are carried out exactly. The quantum number n 
can arbitrarily be large. Hence, Onsager’s formula can be applied for any strength of field (Problem 
10 . 1 . 1 ). 

However, 110.21 turns out to contain a limitation. The curvatures along the closed k-orbit must 
either be entirely positive or negative. The k-orbit cannot have a mixture of a positive-curvature 
section and a negative-curvature section. 

Problem 10.1.1. Consider a free electron having mass m and charge q, subject to a constant magnetic 
field B. 

1. Write down Newton’s equation of motion. 

2. Show thatthe magnetic force qv x B does not work on the electron; that is, the kinetic energy E 
= mv 2 /2 does not change with time. 

3. Show that the component of v parallel to B is a constant. 

4. Show that the electron spirals about the field B with the angular frequency co = eB/m 

5. Show that the orbit projected on a plane perpendicular to the field B is a circle of radius R = p 
± /co = mv^/eB, where v represents the speed of the circular motion. Find the maximum radius 

lv max* 

6. Define the kinetic momentum n = mv and express the energy s in terms of Hj. 

7. Choose the x 3 (=z)-axis along B. Show that the curve represented by E(p 1? p 2 , 0) = s is a circle 
of radius (2 me) 1/2 = n. 

8. Show that the areas of the circles obtained in parts 5 and 7 differ by the factor (eB) 2 . 


10.2 Statistical Mechanical Calculations: 3D 


Susceptibility is an equilibrium property and, therefore, can be calculated by using standard 
statistical mechanics. Here, we demonstrate Onsager’s formula 110.2^) using a free electron model. 

The free energy F is, from 19.43k 


F =N/i -2 fe B T / dc ^ In f 1 — e^-'Wa’n] . 
(10.13) o di ' 

The oscillatory statistical weight w osc is, from 19.53k 
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We note that w osc oscillates with alternating signs. In fact the relevant energy e is of the order of the 
Fermi energy s F , which is much greater than the cyclotron frequency co c times the Planck constant h. 
Hence, if there are many oscillations within the width of df/de of the order k B T, then the contribution 
to F must vanish. This condition is shown in Figure 10.3 . Let us study this behavior in detail. Using 
(9 i 54), we obtain 

Figure 10.3 Numerous oscillations in W osc within the width of-df/ds cancel out the contribution to 
the free energy F. 

f(e) -df/de 



F — Nfi 4 - 2 

(10T5) 

The critical temperature T c below which the oscillations can be observed is 
(10.16) fc B r c ~^ c . 

Below this critical temperature (T < T c ), we cannot replace -dfrds by 6(s - p) since the integrand 
varies violently. The integral in (10.15) . 

OO c OO 

I l de'Woscfr') = f dcVVo K {E)/{E) , 

HO. 171 o o o 

must be calculated with care. We introduce a new variable £ = p(e - p) and extend the lower limit to 
-cg(Pp —► oo) and obtain 
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Using sin(A + B) = sin A cos B + cos A sin B and 

OO 

/ dte iaC _!___-_ 

mm -l eC+1 isinh,Ta ' 

(Problem 10.2.1), we obtain from U 0.151 (Problem 10.2.2) 

p = 4 (l t , {-vrcayxvetithuj-m 

(T0.20) V jt c R ^ v 1 ! 1 sinh(2jr 2 vfeBT/(ftto c )) 

Although (10.20) contains an infinite sum with respect to v just as the infinite sum in (9*47), its 
summation character is quite different. Only the first term with v = 1 in the sum is important in 
practice because (sinh(27r 2 vk B T/(hco c ))) -1 « 1. Thus, we obtain 
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Using this equation we calculate the magnetization I = - V _1 <3F/SB and obtain 

kftT /ft&» r \ 1;2 cos(2jTe F /(ft(i> (; ) — a/4) 


(1 0.221 
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sinh(2n; 2 fc B T/(fta> c )) 

The neglected terms are exponentially smaller than those in (10.22) since exp(k B T/(hco c )) » 1. In the 
low field limit, the oscillation number in the range k B T becomes great, and hence, the contribution of 

the sinusoidal oscillations to the free energy must cancel out. This effect is represented by the factor 
ak [5 r 

sinh(2jr 2 K B r/(fto e )) ' 

We define the susceptibility % by 

_J_ 

0023) X ~1 ' 

Note that the magnetization I is not necessarily proportional to the field B. Using Equations 19.231 . 
(9.581 . (10.221 . and (10.231 . we obtain (Problem 10.2.3) 
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Our calculations indicate that 

(a) The oscillation period is e F /(hco c ). This result confirms Onsager’s formula (10.21 . In fact the 
maximum area of 7ip F 2 occurs at p z = 0. Hence, 

1 2 Tift 7T pf. CF 

(2:rft} 2 “ B~ = tuT c 7 

if the quadratic dispersion relation 8 = p 2 /(2 m*) holds. We note that all electrons participate 
in the cyclotronic motion with the same frequency ro c , and the signal is substantial. 

(b) The envelope of the oscillations exponentially decreases in B -1 as 

- 2 :r 2 fc B r\ / (5 \ 2 ^ 2 ki ) rm S: ft 


exp 


O^W-i • - 


(10.26) * V ^ 

Thus, if the “decay rate” 5 in B -1 is measured carefully, the effective mass m* may be obtained 
directly through 

* 

HI = - 

(10.27) ' ■ 

The calculations in this section were carried out by assuming a quasifree electron model. The 
actual physical condition in solids is more complicated. We cannot use the quasifree particle model 
alone to explain the experimental data. 

Problem 10.2.1. Verify (10.191 . Hint: Consider an integral on the real axis 
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e* + 1 


z — x + iy , a, J? > 0 . 























We add an integral over a semicircle of the radius R in the upper z-plane to form an integral over a 
closed contour. We then take the limit as R —> go. Note that the integral over the semicircle vanishes in 
this limit if a > 0. The integral on the real axis, I(a, oo), becomes the desired integral in (10.19) . 
Evaluate the integral over the closed contour by using the residue theorem. 

Problem 10.2.2. Derive (10.20) . 

Problem 10.2.3. Verify (10.24) . 


10.3 Statistical Mechanical Calculations: 2D 


The dHvA oscillations occur in 2D and 3D. The 2D system is intrinsically paramagnetic since 
Landau’s diamagnetism is absent, which is shown here. 

Let us take a dilute system of quasifree electrons moving in a plane. Applying a magnetic field B 
perpendicular to the plane, each electron will be in the Landau state with the energy 


{ 1 \ zB 

e = hate. [ N] + — 1 , (o c = —— > Ni_ = 0,1, 2,... , 

(10.28) V m* 

where m* is the cyclotron effective mass. We introduce kinetic momenta: 

(10.29) ^X = Px + ZA. X , /7ji = Py -\- tAy . 


The Hamiltonian for the quasifree electron is then 

aom n= 2^( n ' +n *) = i^ nl - 

After simple calculations, we obtain 
(10.31) dxd/7*dyd/7 y = djfdp,dydpy . 

We can then represent quantum states by the small quasiphase space elements dxdn x dydn y . The 

Hamiltonian ^ in n0.30f does not depend on the position (x, y). Assuming large normalization 
lengths (1^, Ly), we can represent the Landau states by the concentric shells having the statistical 

weight 


(10.32) 
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with A = l^Ly and hco c = A(n 2 /(2m*)) = UAU/m in the n x n y -space as shown in Figure 10.4 . As the 
field B is raised the separation hco c grows, and the quantum states are collected or bunched together. 

As a result of the bunching, the density of states, v(z), should change periodically since the Landau 
levels (LLs) are equally spaced. The statistical weight w is the total number of states having energies 
less than 


Figure 10.4 Quantization scheme for free electrons: without magnetic field (dots) and in a magnetic 
field (circles) perpendicular to the paper. 











From Figure 10.4 . this w is given by 


(10.33) '—' «L= 

where 0(x) is the Heaviside step function: 

1 ifx>0 
0 if* < 0. 


©(*) = 


(10.34) 

We introduce the dimensionless variable s = 27 ie/(fr c ), and rewrite w as 


W(e) = 2C(ft<u £ ) E &[e* - (2N L + l)Jir], C = 2nm* A(2nA) _2 . 
(10.35f * L =o 

We assume a high Fermi degeneracy such that 


(10.36) f* - e F » ft<o c . 

The sum in (10.35f can be computed by using Poisson’s summation formula [8]: 

OQ i OO 9? 

E /< 2 ™>=2T E / dr /( r > e_iIUT ■ 

(10.37) *=-« 

After the mathematical steps detailed in Appendix A.5, we obtain [9] 

(10.38f W(£) = Hb + >V MC . 


(10.39) 


W 0 =O„ c (^) = a(^ i ) I . 


Wt . c= C* w 2 ftill si n(^) 


no.40f 

The B-independent term W 0 is the statistical weight for the system with no fields. The oscillatory 
term w osc contains an infinite sum with respect to v, but only the first term v = 1 is important in 
practice. The term w osc can generate magnetic oscillations. There is no term proportional to B 2 
generating the Landau diamagnetism 

We calculate the free energy F in f 10.154 using the statistical weight w in f 10.38f through (10.40) . 
and obtain (Problem 10.3.1) 
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where we used the integration formulas in (10.20) . We took the low-temperature limit except for the 
oscillatory terms. The magnetization I, the total magnetic moment per unit area, can be obtained from 


0042) a 3B 

Thus far, we did not consider the Pauli magnetization I Pauli due to the electron spin (see (9J_8)): 


I 2fi * M t V 

M0.43) = -~ ° (CF) = 2B '“~ ■ 

Using (10.41) through 110.431 . we obtain the total magnetization: 

^ total ^ Pauli 3“ i 

j»bB L _ / ep ) kuT / m \ cos(2a:ri>/(A(o c )) 
fl 0.441 n,Ul1 cf [ \is£E / £ F \m* 2 slnh(27c 2 m*fe B 

In this calculation, we neglected the spurious contribution of the B-derivatives of the quantities inside 
the cos(2tt s F /(h c )). This condition is absent when we calculate the magnetization I through 110.421 

directly. The magnetic susceptibility % is defined by the ratio 
(10.45) x ~1' 

Only the first oscillatory term, v = 1, is important and kept in 110.441 since sinh(27i 2 m* k B T/(heB)) » 
1. The negative sign indicates a diamagnetic nature. 

Figure 10.4 clearly shows that all electrons, not just those excited electrons near the Fermi surface, 
are subject to the magnetic field and all are in the Landau states. This is reflected by the fact that the 
Pauli magnetization I Pau]i is proportional to the electron density n, as seen in 110.431 . The oscillatory 

magnetization is also proportional to the density n. At a finite temperature T, -df/ds has a width. In 
this range of the order of k B T many oscillations occur if the field B is lowered. Assuming this 

condition, we obtained 110.441 . and hence, this equation is valid for any finite T. At T = 0, the width 
vanishes and the oscillatory terms also vanish. 

When the system is subjected to an external electric field, all electrons will respond, and hence, the 
magnetoconductivity a should be proportional to the electron density n. The oscillatory statistical 
weight generates a SdH oscillation [1,2] with the factor smaller by the factor k B T/b f in a. Hence, the 

dHvA and SdH oscillations should be similar. 

Stormer et al. [9] measured the SdH oscillations at 1.5 K in GaAs/AlGaAs, by stacking 4000 layers 
equivalent to the area of 240 cm 2 . (Without stacking the signal is too small to observe.) Their data are 
shown in Figure 10.5 . We see here that 


Figure 10.5 Experimental results on a stack of4000 layers of 2D electron systems equivalent to an 
area of240 cm 2 after Stormer et al. [9]. p is the magnetic moment, and % is the susceptibility. The 
trace denoted SdH is Shubnikov-de Haas data on a separate specimen of the same sample. 




















Problem 10.3.1. Verify (10.41). 



(a) the oscillation periods s F /hco c match, 

(b) the magnetization (— I) rather than the susceptibility % behaves more similarly to the diagonal 
resistance p xx , 

(c) the central line of the oscillation (background) is roughly independent of the field, and 

(d) the envelopes for -I and p^ are similar. 

Feature (a) means that both oscillations arise from the same cause, the periodic oscillation of the 
statistical weight yy. Feature (b) simply comes from the same field dependence (the B-independence) 
of the background (central line) of the oscillations. The density of states for a 2D quasifree electron 
system with no field is independent of the energy. The behavior (c) means that GaAs/AlGaAs is 
described adequately by this quasifree electron model. Feature (d) requires further discussion. Our 
formula fl 0.441 indicates that the period of the oscillations, s F /(hco c ) = m s F (heB), and the 

exponential decay rate, 7ik B T/(hoo c ) = 7 ik B Tm* /(he), of the envelope (sinh(27i 2 k B Tm*/(heB))) _1 are 

both controlled by the effective mass m*. This feature is found not to be supported by the observed 
experiments. We shall discuss this point later in Chapter 11. 

In conclusion, the 2D quasifree electron system is intrinsically paramagnetic, since there is no 













Landau diamagnetism. However, there are magnetic oscillations. The GaAs/AlGaAs heterostructure 
is often used for the study of the Quantum Hall Effect (QHE). The parental 3D GaAs is diamagnetic, 
and hence, the magnetic behavior is greatly different in 2D and 3D. 

Problem 10.3.1. Verify n0.41V 

10.4 Anisotropic Magnetoresistance in Copper 

10.4.1 Introduction 

If the Fermi surface is nonspherical, the MR becomes anisotropic. Cu has open orbits in k-space as 
represented by trace b in Figure 10.6 [10]. This open orbit contains positive and negative curvatures 
along the contour of equal energy. No physical electron can move along the orbit as we see presently. 
An “electron” (“hole”) is an elementary excitation which is generated on the positive (negative) side 
of the Fermi surface with the convention that the positive side contains the positive normal vector at 
the surface point, pointing in the energy-increasing direction. Thus, the “electron” (“hole”) has an 
energy higher (lower) than the Fermi energy and circulates counterclockwise (clockwise) when 
viewed from the tip of the applied magnetic field vector (a standard definition). Since the static 
magnetic field cannot supply energy, no physical electron can travel electron-like in one section of the 
energy contour and hole-like in another. Klauder and Kunzler [11] observed a striking anisotropic 
MR as reproduced in Figure 10.7 . The MR is over 400 times the zero-field resistance in some 
directions. We study an isotropic MR, applying kinetic theory and using the Fermi surface. 

Figure 10.6 The figure shows a closed orbit “a” in k-space that can be traced by the electron, and an 
open orbit “b” that extends over the two Brillouin zones and that cannot be traveled by an electron. 



( a ) 









Figure 10.7 The striking anisotropy of the MR in Cu after Klauder and Kunzler [11]. The [001] and 
[010] directions of the Cu crystal are shown. The current flows in the [100] direction. The magnetic 
field is in the plane (100); its magnitude is fixed at 1.8T~(=18kG), and its direction varies 
continuously from [001] to [010] as shown in the inset. 
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Electron transport has traditionally been dealt with using kinetic theory or the Boltzmann equation 
method. In the presence of a static magnetic field, the classical electron orbit is curved. Then, the 
basic kinetic theoretical model in which the electron moves on a straight line, breaks down. 
Furthermore, the collision term of the Boltzmann equation containing the scattering cross section 
cannot be written down. Fortunately, quantum theory can save the situation. If the magnetic field is 
applied, then the classical electron continuously changes from straight line motion at zero field to 
curved motion at a finite magnetic field. When the magnetic field is gradually applied, the energy of 
the electron does not change, but the resulting spiral motion always acts so as to reduce the magnetic 
fields. Hence, the total energy of the electron with its surrounding fields is less than the sum of the 
electron energy and the unperturbed field energy. The electron dressed with the fields is in a bound 
(negative energy) state, and it can resist the break-up. The guiding center of the circulation can move 
in all directions in the absence of the electric field. If a weak electric field is applied in a direction, 
the dressed electron whose position is the guiding center, preferentially jumps in the field direction, 
and generates a current. We can apply kinetic theory to the guiding center motion, and obtain a 
formula for the electrical conductivity [12, 13]: 
n c e 2 r 

n 0.461 g = ~ J 

where i\. is the density of the dressed electrons, e the charge, M* the magnetotransport (effective) 
mass, and x the relaxation time. The magneto trans port mass M* is distinct from the cyclotron mass 







m . Equation (10.461 can also be obtained by the Boltzmann equation method as shown earlier by 
Fujita et al. [12, 13]. In [12, 13], the dressed electron is identical to the composite fermion [14-17] 
used in the theory of the QHE [18, 19]. Briefly, the electron circulates around a finite number of flux 
quanta (fluxons) that are intact according to Onsager’s flux quantization hypothesis [20]. Applying 
relativity, we may see that the fluxons move around the electron. From this viewpoint, the dressed 
electron is considered to carry a number of fluxons. Thus, the dressed electron is composed of an 
electron and fluxons. The composite particle moves as a fermion (boson) if it carries an even (odd) 
number of fluxons [21-23]. The fiee-energy minimum consideration favors a population dominance of 
c-fermions, each with two fluxons, over c-bosons, each with one fluxon, in the experimental 
conditions at liquid helium temperatures. The entropy is much higher for the c-fermions than for the c- 
bosons. The magnetic oscillation, which occurs only with Fermionic carriers, is observed inCu. This 
experimental fact also supports the idea that the carriers in the magnetotransport are c-fermions. 

Pippard in his book, Magnetoresistance in Metal, [24], argued that the MR for the quasifree particle 
system vanishes after using the relaxation time approximation in the Boltzmann equation method. The 
MR in actual experimental conditions is found to be always finite. Equation (1 0.461 . in fact, contains 
the magnetotransport mass M* distinct from the electron mass m*. This fact alone makes the MR 
nonzero. 

The MR is defined by 

MR = jV = 

(10.47) a> a> 

where p(B) is the magnetoresistivity at the field magnitude B and p 0 = p(0), the resistivity at zero 
field. 

First, we regard the small necks on the Fermi surface as singular points (see Figure 10.7 1. There 
are eight singular points in total on it. If the magnetic field B is along the direction [001], then there 
are two planes (parallel to the plane that the two vectors [010] and [100] make) containing four 
singular points each. The same condition also holds when the field B is along the direction [010]. 
These conditions correspond to the major minima of MR in Figure 10.7 . Next, we consider the case 
in which the field B is along [Oil]. There are three planes perpendicular to [Oil] which contain two, 
four, and two singular points. This case corresponds to the second deepest minimum of MR. Lastly, 
the broad minima in data of MR correspond to the case where the field B is such that there are four 
planes perpendicular to B, each containing two singular points. In this case, there is a range of angles 
in which this condition holds. Hence, these minima should be broad. This singular-points model can 
explain the presence of these minima in MR. We propose a more realistic model in the following 
section. 

10.4.2 Theory 

We shall introduce the following theoretical model: 

(i) We assume that the magnetoconductivity a can be calculated based on (10.46) . The effective 
mass M and the relaxation time x are unlikely to depend on the direction of the field B. Only the 
conduction electron density r\. depends on the B-direction relative to the lattice. 

(ii) We assume that each neck (bad point) is represented by a sphere of radius a centered at the 








eight singular points on the ideal Fermi surface. When the magnetic field B is applied, the 
electron, then, circulates perpendicular to B in the k-space. If it hits the bad sphere, then it cannot 
complete the orbit, and cannot contribute to the conduction. 

In Figure 10.7 . we see the following five main features as the magnetic field B is rotated in the 
(100) plane from [001] to [Oil]. The MR has a deepest minimum (labeled a in Figure 10.7 1. a 
greatest maximum (b), a broad and flat minimum (c), a second greatest maximum (d), and a second 
deepest minimum (e). These features are repeated in the reversed order as the field is rotated from 
[Oil] to [010] due to the symmetry of the Fermi surface. We note that the three minima, a, c, and e 
were qualitatively explained earlier based on the singular-points model. There are eight bad spheres 
located on the Fermi surface in the direction {111) from the center O. 

We first consider case c. In Figure 10.8 . the Fermi surface viewed from [100] is shown. The four 
dark parts represent the nonconducting k-space volumes (bad volumes). The bad volume contains 
balls. There are four bad volumes here, and all the centers of the balls in the projected plane 
perpendicular to [100] lie on the circle of the radius R. This result is connected with the Fermi 
momentum p F = hk by 

Figure 10.8 The electron circulates perpendicular to the magnetic field B in the k-space. If the 
electron hits the bad ball of radius a, it will not complete circulation, and it does not contribute to the 
conduction. The four dark slices viewed from [100], each a width of 2a, contain nonconducting 
electrons. 




Each bad volume can be calculated by using the integration formulas (Problem 10.4.1): 
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where Xq = R cos 0 is the x-component of the ball center, and 0 is the angle shown in Figure 10.8 . The 

four centers of the balls lie on the circle, separated by n/2 in angle. We consider the sum of a pair of 
two bad volumes associated with the centers at 0 and 0 + n/2. Using U0.49T cos(0 + n/2) = - sin 0, 
and sin 2 0 + cos 2 0 = 1, we obtain 


(10.50) 

The volume of the other pair with the centers at 0 + n and 0 + 3n/2 contributes the same amount. 
Therefore, using U 0.481 . the total bad volume is 

„ 16 ,2 3 J 

(10-511 V ‘ = T’“ lk - 3*» ' 

The volume V c does not depend on the angle 0, supporting the broad minimum c observed in the 
experimental data. 

Similarly, we can calculate the bad volumes for the other cases, a and c, and the resultant volumes 
are 
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v-^v 
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Going from c to e, the four-slice volume monotonically changes to the three-slice volume. This 
means that the MR changes smoothly without taking a maximum From c to a, the four-slice volume 
changes monotonically to the two-slice volume. Thus, the present model generates no MR maxima. 
We shall give an explanation for the observed MR maxima in the next section. 

The conducting (good) volume is equal to the total ideal Fermi sphere volume ( 4 ) 7 rk 3 subtracted by 
the bad volume. The conduction electron density n c is given by the ideal density n multiplied by the 
ratio of the good volume over the ideal Fermi sphere volume: 

(4/ 3) jt fe 3 - V 

(10.54) Hc ~ n W3)jrfe3 ■ 

We call the inverse of the magnetoresistivity p(B) the magnetoconductivity g(B). Rewriting Ap/p 0 
in terms of o, we obtain 

Ap = 1/g-l/f/o <jq _ 

(10.55) “ P® * ’ 

where o 0 = l/p 0 is the zero-field conductivity, which can be calculated with the following formula: 

noe 2 ro 

f!0.56f °° m* 

where m is the cyclotron mass, and the suffix 0 denotes the zero-field quantities. 















Since in the experiments, the MR is very large, ~ 100, compared with unity, we may ignore -1 in 
(10.55) . Using tl 0.461 and (10.56) . we obtain 

A p M* to no 

(10.57) ' 

This indicates that the lower the magneto conduction electron density r^, the higher becomes the MR. 


10.4.3 Discussion 

In Figure 10.7 . we observe a MR maximum near [Oil]. The calculation using our model shows a 
mono tonic change of MR from the four-slice configuration c to the three-slice configuration e. We 
propose the following explanation. In going from c to e, an overlap of the bad volumes must occur, 
and the overlapping should not be line-sharp, as assumed in our model. Then the fluctuations, which 
must occur, generate dissipation. This results in a resistivity maximum in d. In going from the four- 
slice configuration c to the two-slice configuration a, an overlapping must also occur. Since, in this 
case, the two overlaps take place simultaneously, the fluctuations and the resultant resistivity increase 
should be greater, generating a MR maximum higher in b than that in d. 

In Figure 10.7 . we observe that the MR minimum varies from 80 to 340 (from e to c). This arises 
from the change in the conduction electron density r^. Now, we estimate the ratio of the bad ball 

diameter a over the Fermi momentum k, using the two MR values. With (10.51) . (10.53) . 110.541 . and 
(10.57) we obtain 

(4/3} jrfe 3 — (5/8) V c 340 

(10.58) (4/3)irU - V c " ur' 

This yields the ratio: 

— — 0 23 

(10.59) k 

which is reasonable. 

We see in Figure 10.7 that the MR rises quadratically with the field angle away from the minimum 
at e. Our model explains this behavior as follows. 

The center of one of the balls lies at 0 = n/2 for case e. We introduce a small deviation angle <J) such 
that 

_ JT 

(10.60) ' 2 ' 

Then, we have 

{ 4> + ji\ 

xn — Rcos -) = i? sin d> R<p . 

(10.61) V 2 j 

Using this and 110.491 . we calculate the bad volume near e and obtain 

( 10 . 62 ) Vr= l Vt + 2 ”‘ HV - 

Thus, this shows that the MR rises quadratically in the deviation angle (j) on the positive and negative 
sides. This is in agreement with the MR data as shown in Figure 10.7 . This quadratic behavior holds 
true for the regions between a and b. 

In our model a spherical ball was used for a bad volume. We may consider an ellipsoidal (two 


















parameters) model for the improvement. 

Other noble metals such as Ag and Au are known to have the Fermi surface with necks. If the bad 
balls are greater in relative size (a/k), our theory predicts more prominent MR. Experimental 
confirmation of this behavior is highly desirable. We suggest that the experiments be done below 1 K, 
where the phonon scattering is negligible and the MR minima become more visible. Only the minima, 
and not the maxima, contain important information about the Fermi surface. 

In conclusion, the spectacular angular dependence of the magnetoresistance in Cu can be explained 
by using the Drude formula based on the “neck” Fermi surface. The resistance minima can be used to 
estimate the “neck” size. The resistance (dissipation) maxima arise from the density fluctuations. 
Problem 10.4.1. Verify (10.49) . 


10.5 Shubnikov-de Haas Oscillations 


Oscillations in magnetoresistance, similar to the dHvA oscillation in the magnetic susceptibility, were 
first observed by Shubnikov and de Haas in 1930 [25]. These oscillations are often called the 
Shubnikov-de Haas oscillations. The susceptibility is an equilibrium property and can therefore be 
calculated by standard statistical mechanical methods. The MR is a nonequilibrium property, and its 
treatment requires a kinetic theory. The magnetic oscillations in both cases arise from the periodically 
varying density of states. We shall see that the observation of the oscillations gives a direct 
measurement of the magnetotransport mass M . The observation also gives quantitative information 
on the cyclotron mass m*. 

Let us take a dilute system of electrons moving in a plane. Applying a magnetic field B 
perpendicular to the plane, each electron will be in the Landau state with the energy (see Section 
9.3): 


/ 1 \ £ B 

£ = ( ra H- — I , ( 0 Z = — , jV| = 0, L 2 ,... , 

V 2; m* 


(10.63) 

where m* is the cyclotron mass. The degeneracy of the LL is given by (Problem 10.5.1) 

&BA , 

/1 a r a\ t—t > A = sample area . 

(10.64) 2ji fi h 

The weaker is the field, the more LLs, separated by hco c , are occupied by the electrons. In this Landau 
state the electron can be viewed as circulating around a guiding center. The radius of circulation 1 = 
(h/(eB)) 1/2 for the Landau ground state is about 250 A at a field B = 1.0T. If we now apply a weak 
electric field E in the x-direction, then the guiding center jumps and generates a current. 

Let us first consider the case with no magnetic field. We assume a uniform distribution of inpurities 
with the density n 2 . Solving the Boltzmann equation, we obtain the conductivity (Problem 10.5.2): 


(10.65) r \ d.*/ 2 m* 

where T is the energy (s)-dependent relaxation rate, 

( 10 . 66 ) m = 

where 0 = scattering angle and I(p, 0) = scattering cross section, and the Fermi distribution function 
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with P = (k B T) _1 and \x = chemical potential is normalized such that 
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( 10 . 68 ) Pm 1 

where the factor 2 is due to the spin degeneracy. We introduce the density of states, u(e), such that 
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(10.69) ( 2jl *) 2 
We can then rewrite 110.651 as 
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110.701 c 

The Fermi distribution function f(e) drops steeply near e = g at low temperatures: 

(10.71) k B r« fF . 

The density of states, £>(s), is a slowly varying function of the energy 8. For a 2D quasifree electron 
system, the density of states is independent of the energy 8. Then the Dirac delta-function replacement 
formula 

110.721 ~df = 


can be used. Assuming this formula, using 
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I <kD(r)c (-it) = j d£«( E )/< £ ), 


(10.73) 5 


and comparing 110.651 and 110.701 . we obtain 

OC 
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(10.74) o 

Note that the temperature dependence of the relaxation time x is introduced through the Fermi 
distribution function f(e). 

Next we consider the case with a magnetic field. We assume that a dressed electron is a fermion 
with magnetotransport mass M* and charge e. Applying kinetic theory to the dressed electrons, we 
obtain the standard formula for the conductivity: o = ne 2 x/M*. As discussed earlier, the dressed 
electrons move in all directions (isotropically) in the absence of the electric field. 

We introduce kinetic momenta: 


(10.75) n * = P* + eA * ' n i = Pt + eA v - 
The quasifree electron Hamiltonian ^ is 

mm u= ^ ni+n $ = ^ nK 

The variables (n x , n y ) are the same kinetic momenta introduced earlier in 19.341 . Only we are 
dealing with a 2D system here. After simple calculations, we obtain 

(10 77) d*d/7 x dyd/7 }1 = d^djj^dydpj. . 

We can represent the quantum states by quasiphase space element dxd/II x dyd/n y . The Hamiltonian % 














in (10.76) does not depend on the position (x, y). Assuming large normalization lengths (1^, Ly), A = 
L x L y , we can then represent the Landau states by the concentric shells in the LLJIy-space (see Figure 
10.9 ). having the statistical weight 


Figure 10.9 The circulation part of the Landau states is represented by the circular shell in n x II y - 
space. 
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with the energy separation hco c = A(nA(2m )) = II A ll/m . Equation tl 0.781 confirms that the LL 
degeneracy is eBA/(27ih), as stated in n0.64L 

Let us consider the motion of the field-dressed electrons (guiding center). We assume that the 
dressed electron is a fermion with magnetotransport mass M* and charge e. The kinetic energy is 
represented by 


?M* ) — 


■n i 


(10.79) ,t '*“ 2M* \“ x 1 “rf - 2M* 

According to Onsager’s flux quantization represented by (10.11) . the magnetic fluxes can be counted 
in units of ® 0 = e/h. The dressed electron is composed of an electron and two elementary fluxes 

(fluxons). A further explanation of the present model will be given later. 

Let us introduce a distribution function (p(IT, t) in the II X II y -space normalized such that 

,, 2 , f d 2 FI<p(Fl Xl Fly, f) = — = ft , 

(10.80) {2 **} 2 1 y ' A 

The Boltzmann equation for a homogeneous stationary system is 


(10 81) e(f + " * B) '^ = / isl TF^nn.emn')- v {n) ] , 

where 0 is the angle of deflection, that is, the angle between the initial and final kinetic momenta (II, 
IT). In the actual experimental condition the magnetic force term can be neglected. Assuming this 
condition for now, we obtain the Boltzmann equation for a field-free system.^* Hence, we obtain the 
same conductivity formula 110.651 with m* replaced by M*, yielding 110.461 . 

As the field B is raised, the separation hot> c becomes greater and the quantum states are bunched 
together. The density of states should contain an oscillatory part: 
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where (j) 0 is a phase. Since 

1 (weak field) , 

(10.83) [ * 

the phase cj) 0 will be dropped hereafter. Physically, the sinusoidal variations in flO.821 arise as 

follows. From the Heisenberg uncertainty principle (phase space consideration) and the Pauli 
exclusion principle, the Fermi energy e F remains approximately constant as the field B varies. The 

density of states is high when e F matches the N L th level, while it is small when e F falls between 
neighboring LLs. 

If the density of states, u(s), oscillates violently in the drop of the Fermi distribution function f(s) = 
( e P(e- F ) + i)-i 5 one cannot use the delta-function replacement formula. The use of (TO.72! is limited to 
the case in which the integrand is a smooth function near 8 = p. The width of df/de is of the order k B 

T. The critical temperature T c below which the oscillations can be observed is 


(10.84) k R r - . 

Below the critical temperature T < T c , we may proceed as follows. Let us consider the integral 
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For the temperature satisfying (3s = e/(k B T) » 1, we can use the same mathematical steps as going 
from flO.171 to tl0.201 and obtain 

= cos(2jie F /(*«J) 
flO.861 * b sinh(23i 2 M*fe|jr/(AeB)) ' 

Here we used 

(10.87) M V(T = 0) = ra*t F =ip!. 

which follows from the fact that the Fermi momentum p F is the same for both dressed and undressed 
electrons. 

In summary, (i) the SdH oscillation period is s F /(ho) c ), which is the same for the dHvA oscillations. 
This arises from the bunching of the quantum states, (ii) The envelope of the oscillations 
exponentially decreases like [sinh(27t 2 M*k B T/(heB))] _1 . Thus, if the “decay rate” 6 defined through 


-6)—PS*) 


( 10 . 88 ) 


is measured carefully, the magnetotransport mass M can be obtained directly through M = 
ehS/(27r 2 k B T). This finding is quite remarkable. For example, the relaxation rate x _1 can now be 

obtained through f 10.461 with the measured magnetoconductivity. All electrons, not just those excited 
electrons near the Fermi surface, are subject to the E-field. Hence, the carrier density r\. appearing in 

tl0.461 is the total density of the dressed electrons. This r\. also appears in the Hall resistivity 
expression: 

_ £"h _ _ B 

(10.89) " _ J “ en c i > d ~ en c ' 













where the Hall effect condition E H = v d B, v d = drift velocity, was used. 

In cyclotron motion the electron with the effective mass m circulates around the magnetic fluxes. 
Hence, the cyclotron frequency co c is given by eB/m . The guiding center (dressed electron) moves 

with the magnetotransport mass M , and therefore, this M appears in the hyperbolic sine term in 
fl 0.86V 

In 1952 Dingle [26] developed a theory for the dHvA oscillations. He proposed to explain the 
envelope behavior in terms of a Dingle temperature T D such that the exponential decay factor be 


(10.90) 
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k — constant. 


Instead of the modification in the temperature, we introduced the magnetotransport mass M* to explain 
the envelope behavior. The susceptibility £, is an equilibrium property, and hence, £, should be 
calculated without considering the relaxation mechanism In our theory, the envelope of the 
oscillations is obtained by taking the average of the sinusoidal density of states with the Fermi 
distribution of the dressed electrons. There is no place where the inpurities come into play. The 
theory may be checked by varying the impurity density. Our theory predicts little change in the clearly 
defined envelope. The scattering will change the relaxation rate t _ 1 and the magnetoconductivity for 
the center of the oscillations though 00.461 . 

In the present theory, the two masses m and M were introduced corresponding to two physical 
processes: the cyclotron motion of the electron and the magnetotransport motion of the dressed 
electron. The dressed electrons are present whether the system is probed in equilibrium or in 
nonequilibrium as long as the system is subjected to a magnetic field. The presence of the c-particles 
can be checked by measuring the susceptibility or the heat capacity of the system All dressed 
electrons are subject to the magnetic field, and hence, the magnetic susceptibility % is proportional to 
the carrier density n c , although the % depends critically on the Fermi surface. This explains why the 

magnetic oscillations in the conductivity and the susceptibility are similar. 

Problem 10.5.1. The Landau levels are highly degenerate. Derive that the degeneracy, that is, the 
number of electrons that can occupy each Landau level, is given by eBA/(27ih), where A is the sample 
area perpendicular to the magnetic field B. 

Problem 10.5.2. Derive the conductivity formula 110.651 . 
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1) The Boltzmann equation for a magnetic field-free system, where a small constant electric field E 
is applied to the positive x-axis, is given by 
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Chapter 11 


Quantum Hall Effect 


Major experimental facts associated with the Quantum Hall Effect (QHE) and a brief theoretical 
survey are given in this chapter. A theory of treating integer and fractional QHE is developed in a 
unified manner based on the model, in which the phonon exchange between the electron and the 
elementary magnetic flux (fluxon, half-spin fermion) binds an electron-fluxon composite. The center 
of mass of the composit boson moves with the linear dispersion relation: s = (2/7i)v F q, v F = Fermi 

speed, q = momentum The 2D system of free composite bosons undergoes a Bose-Einstein 
condensation at k B T c = 1.241 hv F no 1/2 , where ly, is the density. The QHE state at the Landau-level 
occupation ratio v = P/Q, integer P, odd integer Q, is shown to be the superconducting state with an 
energy gap generated by the condensed composite (c-)bosons, each containing one electron and Q 
fluxons and carrying the fractional charge e/Q. This state is formed by phonon exchange from the c- 
fermions, each with Q - 1 fluxons, occupying the lowest P Landau states. The c-boson density iiq is 

one Pth the electron density ly, making the QHE state less stable with increasing P. In Jain’s theory of 

fractional hierarchy, the effective magnetic field for the c-fermion vanishes at the sermonic fraction 
P/Q, even Q. This idea is extended to the c-particle (boson, fermion) which moves field-free in 2D at 
v = P/Q. The plateau value (Q/P)(h/e 2 ) at the fractional ratio v = P/Q, odd Q, directly indicates the 
fractional charge (magnitude) e/Q for the composite with Q fluxons. 

11.1 Experimental Facts 

In 1980 von Klitzing et al. [1] reported the discovery of the integer Quantum Hall Effect (QHE). In 
1982 Tsui et al. [2] discovered the fractional QHE. The GaAs/AlGaAs heterojunction is shown in 
Figure 11.1 . The electrons are trapped in GaAs near the interface within a distance of the order of 
100 A. The QHE measurements are normally carried out in the geometry shown in Figure 11.2 . The 
Hall field E H is the Hall voltage V H divided by the sample width W. The Hall resistivity p H is 

defined by the ratio: 

Figure 11.1 The GaAs/AlGaAs heterojunction. The electrons are localized within less than 100 A of 
the interface. The shaded area represents the electron distribution. 







Figure 11.2 Hall effect measurements. The Hall voltage V H is generated perpendicular to the current 
J and the magnetic field B. 



is the current density defined as the current J over the sample length L. The external electric field E is 
defined as the voltage V over the length L. Equation (11.2) defines the resistivity p according to 
Ohm’s law. Figure 11.3 represents the data reported by Tsui [3] for the Hall resistivity p H and the 

resistivity p in GaAs/AlGaAs at 60 mK. The Quantum Hall (QH) states at the Landau-Level (LL) 
occupation ratio, also called the filling factor, v = 1, 2, ..., 6 are visible in Figure 11.3 . Clearly, each 
QH state with a Hall resistivity plateau (horizontal stretch) is superconducting (zero resistance). In 
the normal metal the Hall resistivity p H is linear in the applied magnetic field B (Problem 11.1.1). 

The Hall coefficient R H is defined as the ratio E H /jB: 


Figure 11.3 Observed QHE in the GaAs/AlGaAs heterojunction at 60 mK, after Tsui [3]. The Hall 
resistivity p H and the resistance p are shown as a function of the magnetic field B in tesla. 
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(11.3) ' Jfi B 

The plateau stability arises from the Meissner effect, which expels the excess weak magnetic field 
measured relative to the center position (field) of the Hall resistivity plateau from the 
superconducting body. The stability against the electric field arises from the gap in the elementary 
excitation energy spectrum, which will be explained later. The superconducting state with an energy 
gap is sometimes referred to as the incompressible quantum fluid state in the literature [4-9]. We 
avoid this characterization in the present text since no pressure is applied to the sys-tem Besides, 
some research groups have investigated the pressure dependence of the QH state. In such a case the 
term incompressible quantum fluid becomes confusing. The superconducting state (no resistance) 
characterizes the state more directly. The plateau heights are quantized in units of h/e 2 , h = planck 
constant, e = electron charge (magnitude), see Figure 11.3 . Each plateau height is material- and 
shape-independent, indicating the fundamental quantum nature, the boundary-condition independence, 
and the stability of the QH state. 

Figure 11.4 represents the QHE data reproduced after Willett et al. [10]. The data indicate a 
remarkable similarity among and between the integer QHE at v = P, P = 1, 2, 3, ..., and the fractional 
QHE at v = P/Q, positive integer (numerator) P, P = 1,2, ... and odd integer (denominator) Q, Q = 1, 
3, ... Compare in particular the QH state at v = 2 and 2/3, suggesting a single cause for both integer 
and fractional QHE. The QHE is temperature-dependent, including the superconducting transition at a 
critical temperature T c , see below. 

Figure 11.4 The Hall resistivity p H (= p xy ) in h/e 2 and the resistivity p(= p^) in GaAs/AlGaAs at 85, 

150 mK are plotted as a function of the magnetic field B in tesla, reproduced after Willett et al. [10], 
The high-field p trace is reduced in amplitude by a factor of 2.5. The upper numbers indicate the 
Landau-level occupation ratio v. 













Landau level occupation ratio 



The resistivity data under fields of up to 50 T from Leadley et al. [11] are shown in Figure 11.5 . 
The QH state below T c is recognized by the Hall resistivity plateau and zero resistivity. We see that 

the critical temperature T c at v = 1/3 is higher than 0.59 K and lower than 1.40 K. The T c at v = 2/7 is 
lower than 0.59 since the resistivity p is finite. 

Figure 11.5 Resistivity p in GaAs/AlGaAs as a function of magnetic field B in the units of tesla, after 
Leadley et al. [11]. 
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In 2007 Novoselov et al. [12, 13] discovered room-temperature QHE in graphene, T = 300 K, B = 
29 T. It is a theoretical challenge to explain the superconductivity and QHE at such high temperatures. 
Problem 11.1.1. Prove that p H in (1 1.11 is linear in B. 
































































11.2 Theoretical Developments 

The 1985 Nobel Prize was awarded to von Klitzing, the discoverer of the integer QHE. The 1998 
Nobel Prizes were shared to Tsui, Stormer (experimental discovery) and Laughlin (theory) for their 
contribution to the fractional QHE. 

The prevalent theories [4, 14, 18, 20-23] based on the Laughlin wavefrmction in the Schrodinger 
picture deal with the QHE at 0 K and immediately above [18, 19]. The system ground state, however, 
does not carry a current. To interpret the experimental data it is convenient to introduce composite (c- 
) particles (bosons, fermions). The c-bosons (fermions), each containing an electron and an odd 
(even) number of fluxons were introduced by Zhang et al. [20] and others (Jain [14-17]) for the 
description of the fractional QHE (Fermi liquid). All QH states with distinctive plateaus in p H are 

observed below around 0.5 K. It is desirable to treat the QHE below and above T c in a unified 

manner. The extreme accuracy (precision ~ 1CT 8 ), in which each plateau is observed, means that the 
current density j must be computed exactly with no averaging. In the prevalent theories [4-9] the 
electron-electron interaction and Pauli’s exclusion principle are regarded as the cause for the QHE. 
Both are essentially repulsive and cannot account for the fact that the c-particles are bound, that is, 
they are in negative-energy states. Besides, the prevalent theories have limitations: 

• The zero temperature limit is taken at the outset. Then the question why QHE is observed below 
0.5 K and not at higher temperatures in GaAs/AlGaAs cannot be answered. We should have a 
theory for all temperatures. We should find the critical temperature T c , below which the 

superconducting state appears. 

• The high field limit is taken at the outset. The integer QHE is observed for small integer P only. 
The question why the QHE is not observed for large P (weak field) cannot be answered. We 
better describe the phenomena for all fields. 

• The p H value (Q/P)(h/e 2 ) is obtained in a single stroke. To obtain the Hall resistivity p H we need 
two separate measurements of the Hall field E H and the current density j. We must calculate (E H > 
j) and take the ratio: p H = E H /J. 

There is a remarkable similarity between the QHE and the High-Temperature Superconductivity 
(HTSC), both occurring in 2D systems. The major superconducting properties observed in the HTSC 
are (a) zero resistance, (b) a sharp phase change, (c) the energy gap below T c , (d) the flux 

quantization, (e) Meissner effect, and (f) Josephson effects. All of these have been observed in the 
GaAs/AlGaAs interface. We regard phonon-exchange attraction as the cause of both QHE and 
superconductivity. Starting with a reasonable Hamiltonian, we calculate everything using standard 
statistical mechanics. We develop a unified theory of the integer and fractional quantum Hall effects 
in Sections 11.3.3 and 11.3.4. 

11.3 Theory of the Quantum Hall Effect 


11.3.1 Introduction 



Experimental data by Willett et al. [10], reproduced in Figure 11.4 . show that the Hall resistivity p H = 
E H /j (E h : Hall field, j: current density) in the GaAs/AlGaAs heterojunction at extremely low 

temperatures (85-150 mK) has plateaus at various fractional LL occupation ratios (filling factor) v = 
P/Q with positive integer P and odd integer Q, where the resistivity p = E/j (E: applied field) 
vanishes. In particular at v = 1/3, the plateau in p H and the drop in p (not shown) are as distinctive as 

the integer QHE plateau and zero resistivity at v = 1, indicating the superconducting state with an 
energy gap. The plateau heights are quantized in units of h/e 2 (h: Plank constant, e: electron charge 
{magnitude}). The ground state of the GaAs/AlGaAs hetero junction at v = 1/Q, can be described in 
terms of the Laughlin wavefimction [4, 18]. Laughlin [18, 19] pointed out a remarkable similarity 
between the QHE and the cuprate superconductivity, both occurring in 2D. Zhang et al. [20] discussed 
the fractional QHE in terms of the c-bosons, each made up of an electron and an odd number of 
fluxons. The fermionic nature of the fluxon will be discussed below. 

The data shown in Figure 11.4 indicate that the Hall resistivity p H is linear in B at v = 1/2, 

exhibiting a Fermi-liquid state. Jain [14-17] discussed this state and the QHE in terms of the c- 
fermions, each made up of an electron and two fluxons. The c-particle appearing at 0 K must have a 
lower energy than the constituents (electrons, fluxons), which requires an attractive interaction 
Hamiltonian. Following the Bardeen-Cooper-Schriefer (BCS) theory of superconductivity [24], 
where the Cooper pair [25] is formed by the phonon-exchange attraction, we regard phonon-exchange 
attraction as the cause of the QHE. The most remarkable feature of Laughlin’s theory is his 
introduction of the fractional charge for the elementary excitation. The particle-number nonconserving 
processes, such as the phonon exchange and the formation of the c-particle, can best be treated using 
the second-quantization (field-theoretical) formulation [26]. The Fermi statistics of the electrons 
concisely in terms of the anticommutation rules for the creation and annihilation operators. 

All QHE states with distinctive plateaus in p H in GaAs/AlGaAs are observed below around 0.5 K 
[18, 19]. We regard this temperature as the critical temperature T c for the QHE. The T c can be 

recognized by the Hall resistivity plateau. It is desirable to treat both quantum and classical Hall 
effects below and above T c in a unified manner. We shall develop a finite temperature theory, starting 

with a reasonable model Hamiltonian and calculating everything using standard quantum statistical 
methods. The extreme accuracy (precision ~10 -8 ), in which each plateau value is observed, means 
that the current density j must be computed without averaging. We accomplish this using simple 
kinetic theory and calculating j in terms of the condensed c-bosons. 

The countability concept of the fluxons, known as the flux quantization: 


„ Nh N t 
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where A = sample area, N = integer, O 0 = h/e = flux quantum, was originally due to Onsager [27]. 

This idea and Onsager’s formula are routinely used in the analysis of the de Haas-van Alphen effect 
and the determination of the Fermi surface. 

We now discuss the quantum statistics of the fluxons. The magnetic (electric) field is an axial 
(polar) vector and the associated fluxon (photon) is a half-spin fermion (full-spin boson), which is in 
line with Dirac’s theory [26] that every quantum particle having the position of an observable is a 
half-spin fermion. The magnetic (electric) field is an axial (polar) vector and the associated fluxon 




(photon) is a half-spin fermion (full-spin boson). The magnetic (electric) flux line cannot (can) 
terminate at a sink, which also supports the fermionic (bosonic) nature of the associated fluxon 
(photon). No half-spin fermion can annihilate by itself because of angular momentum conservation. 
The electron spin originates in the relativistic quantum equation (Dirac’s theory of electron) [26]. The 
discrete (two) quantum numbers (o z = ±1) cannot change in the continuous nonrelativistic limit, and 

hence the spin must be conserved. The countability and statistics of the fluxon are the fundamental 
particle properties, and hence they cannot be derived starting with a Hamiltonian. We postulate that 
the fluxon is a half-spin fermion with zero mass and zero charge. Only half-spin fermions can form 
composites of definite quantum statistics. 

We assume that the magnetic field B is applied perpendicular to the interface. The 2D Landau level 
energy: 



with the states (N L , ky), N L = 0, 1,2, ..., have a great degeneracy (hidden variable ky). 

We shall develop a microscopic theory of the QHE in analogy with the theory of the cuprate 
superconductivity [28]. 

11.3.2 The Model 

The center of mass (CM) of any composite particle (c-particle) moves as a fermion (boson). The 
eigenvalues of the CM momentum are limited to 0 or 1 (unlimited) if it contains an odd (even) number 
of elementary fermions. This rule is known as the Ehrenfest-Oppenheimer-Bethe (EOB) rule [29, 
30], This rule states that a c-particle with respect to the center of mass motion moves as a fermion 
(boson) if it contains an odd (even) number of elementary fermions. Hence, the CM motion of the 
composite containing an electron and Q fluxons is bosonic (fermionic) if Q is odd (even). The system 
of c-bosons condenses below some critical temperature T c and exhibits a superconducting state while 

the system of c-fermions shows a Fermi liquid behavior. 

A longitudinal phonon, acoustic or optical, generates a density wave, which affects electron 
(fluxon) motion through charge displacement (current). The exchange of a phonon between electron 
and fluxon generates an attractive transition. The phonon exchange between two electrons generates a 
transition in the electron states with the effective (attractive) interaction, see 14,941 : 

7 tlOi„ 
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where e k is the electron energy, hco q the phonon energy, and V q the electron-phonon interaction 

strength. Let us assume an interface formed in (001). The planar arrays of Ga 3+ (A) and As 3 "(B) are 
located alternately in equilibrium along {100) as ABA’B’AB. A longitudinal phonon, acoustic or 
optical, running in (100) can generate a density wave which affects the electron (fluxon) motion by the 
ionic charge displacement (current). The same condition also holds in (010). The lattice wave 
proceeding in the (001) plane can be regarded as a superposition of the waves proceeding in {100) 
and {010), and hence the associated phonon can generate a 2D charge-density wave, and the electron 
(fluxon)-phonon interaction. 

The exchange of a phonon between an electron and a fluxon also generates a transition in the 






electron states with the effective (attractive) interaction: 

, ^ tO ^ 
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where V' q (V q ) is the fluxon-phonon (electron-phonon) interaction strength. The Landau oscillator 
quantum number N L is omitted; the bold k denotes the momentum (ky) and the italic k(= |k|) the 

magnitude. There are two processes, one with the absorption of a phonon with momentum q and the 
other with the emission of a phonon with momentum -q, see Figure 11.6a and b, which contribute to 
the effective interaction with the energy denominators (8| k+q | — e k - hco q ) _1 and (£| k+q | - s k - hco q ) _1 , 

generating (11.7) . The interaction is attractive (negative) and most effective when the states before 
and after the exchange have the same energy (S| k+q | - s k = 0) as in the degenerate LL. 


Figure 11.6 (a,b) Two phonon (wavy line) exchange processes, which contribute to the effective 
interaction in (11.71 . 




BCS [24] assumed the existence of Cooper pairs [25] in a superconductor, and wrote down a 
Hamiltonian containing the “electron” and “hole” kinetic energies and the pairing interaction 
Hamiltonian with the phonon variables eliminated, see Section 7.4. We start with a BCS-like 
Hamiltonian ^ for the present model system: 

«=£'£«+£'£« 
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where n^® = c ks ®''c ks ® is the number operator for the “electron” (1) (“hole” (2), fluxon(3)) at 
momentum k and spin s with the energy s k ®, with annihilation (creation) operators c(c^") satisfying 

the Fermi anticommutation rules: 

LP) Jj)tl _ , Jilt-! 1 ) _ x . f J*l C U) \ _ 0 

(119) | Cjb ’ 1 = CjtsCjt ' 3 ' + C k's fC ks ~ °k,k ,O *S 0 i,J ' C JtV| — U ‘ 

The fluxon number operator n ks (3) is represented by a ks i a ks with a(a^) satisfying the anticommutation 
rules: 


(11 10 ) \ ak “’ ’ { a ka, &k's'} — 0 . 

The phonon exchange can create electron-fluxon composites, bosonic or fermionic, depending on 
the number of fluxons. The center of mass of any composite moves as a fermion (boson) if it contains 
odd (even) numbers of elementary fermions. We call the conduction-electron composite with an odd 
(even) number of fluxons the composite (c-) boson (c-fermion). The electron (hole)-type c-particles 










carry negative (positive) charge. We expect that electron (hole)-type Cooper-pair-like c-bosons are 
generated by the phonon-exchange attraction from a pair of electron (hole)-type c-fermions. The pair 
operators B are defined by 

R mt _ r rn .{lit r < 2> _ r W m 

(11.11) t q ,B - C k+ql2,a C -k+qjl- S ' °kqj ~ C -Jt + ,/2,-s +*/2,1 ' 

The prime on the summation in (11.8) means the restriction: 0 < e lc / j) < hco D , Q D = Debye frequency. 
The pairing interaction terms in til.81 conserve the charge. The term -v 0 B k ’ q / l,t B kq / l) , where v 0 = 
|V q V q |(ha) 0 A) _1 , A = sample area, is the pairing strength, generates a transition in the electron-type c- 
fermion states, see Figure 7.16a . Similarly, the exchange of a phonon generates a transition between 
the hole-type c-fermion states, see Figure 7.16b . represented by _ v 0 B k ’ qs (2 f'B kqs (2 f'. The phonon 
exchange can also pair-create (pair-annihilate) electron (hole)-type c-boson pairs, and the effects of 
these processes are represented by -v 0 B k > qs ( 1 ^B kqs (2 ^ (-v 0 B kqs (1) B kqs (2 (), see Figure 7.18a.b . 

The pairing interaction terms in (11.8) are formally identical to those in 17.231 . Only we deal here 
with c-fermions instead of conduction electrons. We denote creation and annihilation operators by the 
same symbols c. 

We now extend our theory to include elementary fermions (electron, fluxon) as members of the c- 
fermion set. The Cooper pair (electron, electron) is regarded as the c-boson. We can then treat the 
superconductivity and the QHE in a unified manner. The c-boson containing one electron and one 
fluxon can be used to describe the integer QHE as we see later in this chapter. 

We shall use the Hamiltonian ^ in 111.81 and discuss both integer and fractional QHE. We also use 
the same Hamiltonian to describe the QHE in graphene in the following chapter. 


11.3.3 The Integer QHE 


We start with the Hamiltonian ^ in 111.81 . We regard conduction electrons and fluxons as c-fermions 
(building blocks). The pair operators, 


Bp** = & 


111.121 D kq = 


B k'i - a ~ 


. , nC U] 
-k+^2^i +q f2 


are introduced, where we drop the spin indices. The upper indices j = 1(2) mean “electron” (“hole”). 
The interaction Hamiltonians 


j**. 


pair-create (pair-annihilate) electron-type c-bosons and hole-type c-bosons. The c-bosons containing 
one electron and one fluxon, formed at v = 1, will be called the fundamental (f) c-bosons. See Figure 
7.18a.b . The c-bosons can be bound and stabilized by the interaction Hamiltonians -v 0 B k > q ®^B kq ®. 

The fc-bosons (fundamental c-bosons) can undergo the Bose-Einstein condensation (BEC) below the 
critical temperature T c . The fc-bosons condensed at a momentum along the sample length are shown 

in Figure 11.7a . Above T c , they can move in all directions in the plane as shown in Figure 11.7b . 


Figure 11.7 ±fc-bosons at 0 K (or below T c ) (a) and noncondensed fc-bosons above T c (b). 
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First take the -fc-boson. The ground state energy w 0 can be calculated by solving the Cooper-like 
equation [25]: 

h > o '/'{*) = n l I'{k) - - 77-777 f d 2 k''l'{k'), 

(11.13) (2 x*) 2 J 

where T is the reduced wavefiinction for the stationary fc-bosons; we neglected the fluxon energy. We 
obtain after simple calculations 

u* =_ ~ hc0j} _<0 

(11.14) exp(KP 0 )-i) -1 

where d 0 = xj(s f ) is the density of states per spin. Note that the binding energy |w 0 | does not depend on 
the “electron” mass. Hence, the ±fc-bosons have the same energy w 0 . 

At 0 K only stationary fc-bosons are generated. The ground state energy W 0 of the system of fc- 
bosons is 


( 11 . 15)^0 = 2 ^, 

where N 0 is the -(or+) fc-boson number. 

At a finite T there are moving (noncondensed) fc-bosons, whose energies w q ® are obtained from 
[25, 32] 

which is reduced to (11.12) in the small momentum (magnitude) q limit (Problem 11.3.1). For small 
q, we obtain 

where v F ® = (2s F /n^) 1/2 is the Fermi speed. The brief derivation of the formulas (11.16) and (11.17) 

is given in Appendix A.6. The energy w q ® depends linearly on the momentum q. The linear 

dispersion relation for the 3D Cooper pair was obtained by Cooper (unpublished), and was recorded 
in Schriefifer’s book [31]. By the way, the same linear dispersion relation (11.221 also holds for the 
2D Cooper pair in high T c cuprate superconductors [28]. This relation was clearly observed in the 

lowest energy region by Lanzara et al. [33] with Angle-Resolved Photoemission (ARPS) 
Spectroscopy. Formula (11.19) for the critical temperature, see below, also holds for the cuprates 
[28]. 

















The system ofifree massless bosons undergoes a BEC in 2D at the critical temperature [28]: 

(11.18) tflT c = 1.954 ticn'l* , 

where c is the boson speed, and n the density. The brief derivation is given in Appendix A.6. This is 
not a violation of Hohenberg’s theorem that there be no long-range order in 2D, which is obtained 
using an f-sumrule (mass conservation). The theorem does not hold for massless bosons. Substituting 
c = 2/7iv F in (11.18) . we obtain [32] (Problem 11.3.2) 


fe E T c = 1.24* , 


N 0 

n (j = — 
A 


(11-19) 

The interboson distance R 0 = 1/yHo calculated from this equation is 1.24hv F /(k B T c ). The boson size r 0 
calculated from (11.18) . using the uncertainty relation (q max r 0 ~ h) and |w 0 | ~ k B T c , is r 0 = (2/7i)hv F (k B 
T c ) _1 , which is a few times smaller than R 0 . Thus, the bosons do not overlap in space, and the free 
boson model is justified. 

Let us take GaAs/AlGaAs. We assume m* = 0.067 nig, nig = electron mass. For the electron density 
10 11 cm -2 , we have v F = 1.36 x 10 6 cm s -1 . Not all electrons are bound with fluxons since the 
simultaneous generation of ± fc-bosons is required. If we assume = 10 10 cm -2 , we obtain T c = 1.29 
K which is reasonable. The precise measurement of T c may be made in a sample of constricted 
geometry. The plateau width should vanish at T c since s g = 0. 

In the presence of the Bose condensate below T c the unfluxed electron carries the energy 

eP = A j> 


_l_ j 2 , where the quasielectron energy gap A is the solution of 

tr(D D 


1 VoVq 

(11.20) o 


/ 


dr 


1 


(r 2 + A 1 ) 1 / 2 


[l + exp (-0 (c 2 + ^l 2 ) 1 ' 2 )] 


— 1 


where p = (k B T) _1 and vq = u(s F ) is the density of states per spin. Note that the gap A depends on T. 
At T c there is no condensate and hence A vanishes. The moving fc-boson below T c with the 
condensate background has the energy ^> q , obtained from 

XhiutL _ vU) 


WqW(k r q)= \fy{k,q) — — 

(11.21) 1 ( 2:tA ) 

where E® replaced 8® in (11.16) . We obtain 


ilk ‘ 


'i'(k'.q) • 
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= J \q\ = + f g + — 


(j) 
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dc 


m.22) 

where ^(T) is determined from 

ft fy D 

1 = 'DqVq f 

(11.23) o 
The energy difference, 

(11.24) ^olT) “ w o = f 8 ( r ) > 0 - 

represents the T-dependent gap between the moving and stationary fc-bosons. The energy is 


\wq\ + {f 2 + A 2 ) 1 ! 2 












negative. Otherwise, the fc-boson should break up. This limits s g (T) to be less than |w 0 |. Hence, the 
energy gap e g (T) is |w 0 | at 0 K. It declines to zero as the temperature approaches T c from below. 

The fc-boson, having the linear dispersion relation (11.17) or (11.22) . can move in all directions in 
the plane with the constant speed (2/7i)v F ®. The supercurrent is generated by ipfc-bosons 


monochromatically condensed, running along the sample length, see Figure 11.7a . The supercurrent 
density (magnitude) j, calculated by the rule: j = (charge) x (carrier density) x (drift velocity), is 
given by 


/ = e*n 0 Ud — e* no — 
OL25) J °jt 


( 1 ) ( 2 ) 
V'f - 


where e is the effective charge. The induced Hall field (magnitude) E H equals v d B. The magnetic 


flux is quantized: 

B — n<j> <P 0 ,n<p — fluxon density , 


(11.26) 

Hence we obtain 


<I>q — {hje) = magnetic flux (fluxon) . 


E h v>d£ o 1 / h\ 

(11.27) J e n 0 e* e*n Q \ej 

For e* = e, n^ = nQ, p H = h/e 2 , explaining the plateau value observed. 

The supercurrent generated by equal numbers of rpfc-bosons condensed monochromatically is 
neutral. This is reflected in the calculations in (1 1.251 . The supercondensate whose motion generates 
the supercurrent must be neutral. If it has a charge, it would be accelerated indefinitely by the external 
field because the impurities and phonons cannot stop the supercurrent growing. That is, the circuit 
containing a superconducting sample and a battery must be burnt out if the supercondensate is not 
neutral. In the calculation of p H in (11.27) . we used the unaveraged drift velocity difference (2/7i)|v F (1) 
- v F ^|, which is significant. Only the unaveraged drift velocity cancels out exactly from 
numerator/denominator, leading to an exceedingly accurate plateau value. Thus, we explained why the 
precise plateau value in h/e 2 can be observed in experiment. 

The conduction electrons are scattered by phonons and inpurities. The drift velocity v d depends on 
the scattering rate, and p H is proportional to the field B while p is finite. Immediately above T c the 

noncondensed fc-bosons moving in all directions, see Figure 11.7b . dominate the magnetotransport, 
and should show a non-Fermi liquid behavior. 

The resistivity p at v = 1 exponentially rises on both sides, see Figure 11.4 . This can be explained 
as follows. The excited fc-boson has an energy gap e g . Hence, its density has an Arrhenius-law 

Boltzmann factor 


exp 

(11.28) 


(fc*n J' 


B c = ft — Bi f 


where s g (|B c |, T) approaches zero at the super-to-normal boundary. After the field difference from the 
center B 2 = i^Oq = n^h/e), B c , passes the boundary, p returns to normal since the carrier density loses 
the Boltzmann factor. 

















In summary the moving fc-boson below T c has an energy gap e g , generating the stable plateau. The 
fc-bosons condensed at a finite momentum account for the supercurrent with no resistance and the 
plateau value equal to h/e 2 . 

The theory developed in Section 11.3 can simply be extended to the integer QHE. The field 
magnitude is smaller at v = P = 2, 3, ... The LL degeneracy is proportional to B, and hence P LLs 
must be considered. First consider the case P = 2. Without the phonon-exchange attraction the 
electrons occupy the lowest two LLs with spin. The electrons at each level form fc-bosons. See 
Figure 11.8a and b. The fc-boson density n 0 at each LL is one-half the density at v = 1, which is equal 

to the electron density rig fixed for the sample. Extending the theory to a general integer, we have 

Figure 11.8 (a) Landau levels for electron; (b) fc-boson energy bands. The c-fermions which fill up 
the lowest two LLs, form the QH state at v = 2 after the phonon-exchange attraction and the BEC of 
the fc-bosons. 
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This means that both T c (°c n 0 1/2 ) and s g are smaller, making the plateau width (a measure of s g ) 

smaller in agreement with experiments, see Figure 11.4 . The fc-boson has a lower energy than the 
conduction electron. Hence, at the extreme low temperatures the supercurrent due to the condensed fc- 
bosons dominates the normal current due to the conduction electrons and noncondensed fc-bosons, 
giving rise to the dip in p. All dips should reach zero if the temperature is further lowered, see Figure 
11.3 . where the data at 60 mK reported by Tsui [3] are shown. The QHE states with small integers (v 
= 1, 2, ..., 6) are clearly visible. Other QHE states with greater P are overshadowed by the 
neighboring QHE states with small P. 

Problem 11.3.1. Verify that (11.16) is reduced to (11.13) in the small q-limit. 

Problem 11.3.2. Derive (11.19) . 

11.3.4 The Fractional QHE 

Let us consider a general case v = P/Q, odd Q. Assume that there are P sets of c-fermions with Q - 1 
fluxons, which occupy the lowest P LLs, see Figure 11.9a . where we choose Q = 3, P = 2. The c- 
fermions subject to the available B-field form c-bosons with Q fluxons, see Figure 11.9b . Note the 
similarity between Figures 11.8 and 11.9 . In this configuration the c-boson density r^ is given by 

(11.29) and the fluxon density n^ is given by 



























Figure 11.9 The QHE state at v = 2/3. (a) The c-fermions, each with two fluxons, occupy fully the 
lowest two LLs; (b) With the phonon exchange, c-bosons, each with three fluxons, are created and 
occupy the energy bands with the energy gap e 2 . 
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<41.301 n0 ~ T * 

Using (41.27L U 1,291 . and HI.301 . we obtain 

E h v d h Q k 

(11.31) J e*n 0 v u e P e 2 

as observed. We see that the integer Q indicates the number of fluxons in the c-boson and the integer P 
the number of LLs occupied by the parental c-fermions, each with Q - 1 fluxons. To derive the last 
equality in (11.31) . pH = (Q/P)(h/e 2 ), we assumed the fractional charge, 

(11.32) e = Q’ 

following Laughlin [18, 19] and Haldane [34]. Equation ( 11.251 along with <41.291 and (11.301 yield 
that T c decreases as P _1/2 with increasing P. Jain’s hierarchy of fractionals can be obtained by 


examining the fractional around 1/2, using til.291 and (11.30) . 


11.4 Discussion 

Jain’s unification scheme suggests that the fermion nature of electrons and c-fermions must be 
considered in the formulation of the c-bosons, whose condensation was thought to generate the QHE. 
We have incorporated this feature in our c-boson formation model, where the lowest P LLs are filled 
by c-fermions with Q - 1 fluxons without the phonon exchange, while c-bosons with Q fluxons are 
formed with it. In the course of the derivation of (11.31) . we found (11.29) . which allows us to 
discuss the stability of the QHE states quantitatively. For example, the plateau width is almost the 
same at v = 3/7 and 3/5 in Figure 11.4 . The small difference can arise from the v F difference due to 

the effective mass difference associated with the parental c-fermions with Q = 6 and 4. Equations 
(11.6) and (11.19) indicate that both T c and s g decrease as P -1/2 with increasing P. This trend is in 

agreement with all of the plateaus and dips identified in Figure 11.4 . Equation (11.6) also means that 
no QHE is realized for very large P since T c becomes less than the observation temperature. We 

predict that many features appearing in Figure 11.4 are greatly reduced at the extremely low 
temperatures, say 10 mK, so that the only remaining features are the broad plateaus in p H and the 

accompanying zero resistivity at v = 1/Q, odd Q separated by the drops in p H and the sharply peaked 
p at v = 1/Q, even Q similar to the behavior observed in Figure 11.4 . 

In summary the QHE state at v = P/Q odd Q is shown to be the superconducting state with an energy 

































gap generated by the condensed c-bosons, each with Q fluxons carrying the fractional charge e/Q. In 
the present theory the effective field B is defined in the form due to Jain [14-17]: 

1 h 

B = B — B v = B -n e — , n e = electron density 

(11.33) v v e 

with v being fermionic (bosonic) fractions P/Q, even (odd) Q. This means that the c-particles move 
field-free at the exact fraction. For odd Q the c-bosons, if condensed, can move undisturbed due to the 
Meissner effect under a small effective Note added in proof In this Chapter 11 the fractional quantum 
Hall effect is treated by using Laughlins theory and results about the fractional charges carried by 
composite bosons. It is found that the quantum statistical theory in terms of composite particles is 
more comprehensive. All phenomena about the fractional quantum Hall effect can be described within 
the frame work of our quantum statistical theory as given in our paper [35]. 
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Chapter 12 


Quantum Hall Effect in Graphene 


The unusual Quantum Hall Effect (QHE) in graphene is often discussed in terms of Dirac fermions 
moving with a linear dispersion relation. The same phenomenon will be explained in terms of the 
more traditional composite bosons, which move with a linear dispersion relation. The “electron” 
(wavepacket) moves more easily in the direction [llOc-axis] = [110] of the honeycomb lattice than 
perpendicular to it, while the “hole” moves more easily in [001]. Since “electrons” and “holes” move 
in different channels, the number densities can be high especially when the Fermi surface has “necks.” 
The strong QHE at filling factor v = 2 arises from the phonon-exchange attraction in the neighborhood 
of the “neck” Fermi surfaces. The plateau observed for the Hall conductivity is due to the Bose- 
Einstein condensation of the composite bosons, a pair of c-ferimons, each having one electron and 
two fluxons. 


12.1 Introduction 

Experiments [1-3] indicate that there are two kinds of oscillations for the magnetoresistivity p in 
graphene when plotted as a function of the external magnetic field (magnitude) B. Shubnikov-de Haas 
(SdH) oscillations appear on the low-field side while Quantum Hall Effect (QHE) oscillations 
appear on the high-field side. We present a microscopic theory. We start with the graphene 
honeycomb crystal, construct a two-dimensional Fermi surface for the electron dynamics, develop a 
Bardeen-Cooper-Schrieffer (BCS)-like theory [4] based on the phonon-exchange attraction between 
the electron and the fluxon [5], and describe the QHE [6]. The 2D Landau Levels (LL) generate an 
oscillatory density of states. If multiple oscillations occur within the drop of the Fermi distribution 
function, then the SdH oscillation emerges for the magnetoconductivity. This means that the carriers in 
the SdH oscillations must be fermions. The QHE arises from the condensed composite bosons. 

In 2005 Novoselov et al. [1] discovered a QHE in graphene. The gate field effects in graphene are 
reproduced in Figure 12.1 after [1, Figure 1]. The conductivity a as a function of gate voltage V g is 

shown in Figure 12.1a while the Hall coefficient R H measured at magnetic field B = 2T is shown in 
Figure 12.1b . where R H = 1/ne, with n being the conduction electron density, is inverted to show the 
n-linear dependence: 

Figure 12.1 The gate field effects in graphene. Graphene’s conductivity in (a) a (a) and Hall 
coefficient R H (b) as a function of gate voltage V g . RH was measured in magnetic fields B of 2T. R H 

= 1/ne is inverted to emphasize the linear dependence n y The Hall coefficient changes its sign 
near V g = 0. 

After [1, Figure 1]. 







The 1 /R h diverges near V g = 0. The linear relation fit (12.1) yields 


(l 2 . 2) ” = ( 7 - Jx 10 l °cm“ 2 V- 1 )V g . 

The conductivity a(a) rises linearly on both sides at high V g : 

(l 2 3)<7°£V g , V B > 10V . 

This behavior is similar to the case of a metallic single-wall nanotube (SWNT), see Figure 8.8 
(inset), where the o-V curve for a metallic SWNT is shown. Graphene and SWNTs have the same 
configuration but they have different conformations; they have unrolled and rolled graphene sheets. 
The gate voltage generates the mobile surface charges (“electrons,” “holes”). Upon application of a 
bias voltage, the “holes” will move and generate additional charge currents: 

(12.4) J “holes" — cnVf 

with v F being the Fermi speed. The mobility defined by 

o 

(12.5) ne 

reaches 15 OOOcnf^V^s -1 in the experiments. This p-behavior is observed independent of 
temperature T between 10 and 100 K. 

We offer a physical explanation of the a-V g curve in Figure 12.1a . 

The currents near the origin are due to the supercurrents given by (see (8.671 ) 

2 

m 7 su P er — ~^ en o ( v i — ^ 2 ) , 

(12.6) JT 

where is the condensed c-boson density, and Vj (v 2 ) are Fermi speeds of the “electron” (“hole”). 
The system is in a superconducting state in the experimental temperature range: 10-100 K. Thus, the 
curve should be temperature-independent. It is very important to find the superconducting temperature 
T c for the system. We predict that the critical temperature T c is essentially the same for both graphene 











and metallic SWNT. The c-V g curve appears to have a perfect right-left symmetry. This indicates that 

the carriers are only “holes”, and not “holes” and “electrons.” Only “holes” can move along the 
surface boundary as explained earlier. This can be checked by Hall effect measurements. 

Figure 12.2 is reproduced after [1, Figure 2]. The magnetoresistivity p^ at T = 10 K and V g = -60 
V is plotted as a function of magnetic field B(T), exhibiting SdH oscillations. The pxx-B curve is 

remarkably similar to that in Figure 10.5 . the lowest curve, where the SdH oscillations in the 
GaAs/AlGaAs heterojunction are shown. We can interpret the data in the same manner. 

Figure 12.2 The SdH oscillations in graphene at gate voltage V g = -60Y temperature 
T = 10 K. The longitudinal resistivity p^ (kO) is plotted as a function of magnetic field B(T). 

After [1, Figure 2], 


B(T) 



The 2D SdH oscillations have the following features: 

(a) The absence of the Landau diamagnetism This appears as a flat background. 

(b) The oscillation period is s F (ho) c ), where (D c = eB/m*, m* = cyclotron mass, is the cyclotron 
frequency. 

(c) The envelope of the oscillations decreases like [sinh(27t 2 M*k B T/(h B ))] -1 , where M* is the 
magnetotransport mass distinct from the cyclotron mass m*. 

The SdH oscillations in graphene are temperature-dependent. The features evident at 10 K nearly 
disappear at 140 K in the experiments (not shown). 

Figure 12.3 is reproduced after [1, Figure 4]. The longitudinal magnetoresistivity p^ and the Hall 
conductivity a xy in graphene at B = 14 T and T = 4 K are plotted as a function of the conduction 
electron density n (~ 10 12 cm -2 ). 


Figure 12.3 QHE in graphene. 

After Novoselov et al. [1]. 
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The plateau values of the Hall conductivity a xy are quantized in the units of 

Ac 1 

(12.7) T" 

within experimental errors. The longitudinal resistivity p^ reaches zero at the middle of the plateaus. 

These two are the major signatures of the QHE. The strengths of the (superconducting) states decrease 
with increasing filling factor v, see (11.29) . as observed in this figure. We will further discuss the 
features in Figure 12.3 later. 

In 2007 Novoselov et al. [4] reported the discovery of a room-temperature QHE. We reproduced 
their data in Figure 12.4 after [7, Figure 1]. The Hall resistivity p xy for “electrons” and “holes” 

indicates precise quantization within experimental errors in units of h/e 2 at magnetic field 45 T and 
temperature 300 K. This is an extraordinary jump in the observation temperatures since the QHE in 
the GaAs/AlGaAs heterojunction was reported below 1 K. Figure 12.4 is similar to Figure 12.3 
although the abscissas are different, one in gate voltage and the other in carrier density, and hence the 
physical conditions are different. We give an explanation below. 

Figure 12.4 Room-temperature QHE in graphene after Novoselov et al. [7]. Hall conductivity o xy 
(e 2 /h) (dark) and resistance p xx (light) as a function of gate voltage (V g ) at temperature 300 K and 
magnetic field 29 T. Positive values of V g induce “electrons,” and negative values of V g induce 
“holes,” in concentrations n= (7.2 x lO^cnf^WQVg. 
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The QHE behavior observed for graphene is remarkably similar to that for the GaAs/AlGaAs. The 
physical conditions are different, however, since the gate voltage and the applied magnetic field are 
varied in the experiments. The present authors regard the QHE in GaAs/AlGaAs as the 
superconductivity induced by the magnetic field. Briefly, the magnetoresistivity for a QHE system 
reaches zero (superconducting) and the accompanying Hall resistivity reveals a plateau (Meissner 
effect). The QHE state is not easy to destroy because of the superconductivity energy gap in the 
composite boson (c-boson) excitation spectrum If an extra magnetic field is applied to the system at 
optimum QHE state (the center of the plateau), then the system remains in the same superconducting 
state by expelling the extra field. If the field is reduced, then the system stays in the same state by 
sucking in extra field fluxes, thus generating a Hall resistivity plateau. In the graphene experiments, 
the gate voltage applied perpendicular to the plane is varied. A little extra voltage relative to the gate 
voltage at the center of zero resistivity polarizes the system without changing the superconducting 
state. Hence, the system remains in the same superconducting state, keeping zero resistivity and 
constant (flat) Hall resistivity. This state has an extra electric field energy: 

( 12 . 8 ) 7^ A£ ^ 2 ' 

where A is the sample area, s 0 the dielectric constant, and AE is the extra electric field, positive or 

negative, generated by the sample charge. If the gate voltage is further increased (or decreased), then 
it will eventually destroy the superconducting state, and the resistivity will rise from zero. This 
explains the flat p xy plateau and the rise in resistivity from zero. 

The original authors found that the quantization in o xy is exact within experimental accuracy (0.2%) 
as shown in Figure 12.5 which is reproduced after [7, Figure lc], where the Hall resistance R xy at 
45T and 300 K is plotted as a function of the electron density. The quantization in R xy appears at 
h/(2e 2 ), which is a little strange since the most visible quantization for GaAs/AlGaAs appears at h/e 2 . 

Figure 12.5 Hail resistivity R xy (h/e 2 ) for “electrons” (dark circles) and “holes” (light circles) shows 
the exact quantization within experimental errors at 45 T and 300 K, after Novoselov et al. [7]. 
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From the QHE behaviors in Figures 12.3 and 12.5 . we observe that the quantization occurs at a set 
of points: 

h (2P + 1) 

— --- p = 012 

02,91 e 2 2 ' ’ f 

Let us first consider the case: P = 0. The QHE requires a Bose-Einstein condensation (BEC) of c- 
bosons. Its favorable environment is near the van Hove singularities, where the Fermi surface 
changes its curvature sign. For graphene, this happens when the 2D Fermi surface just touches the 
Brillouin zone boundary and “electrons” or “holes” are abundantly generated. Following our recent 
work [8], we shall explain the quantization rule given by 02.91 with the assumption that the c-bosons 
are formed from a pair of like charge c-fermions, each containing a conduction electron and two 
fluxons. 

We postulate that each and every c-fermionhas the effective charge e: 

(12.10) e * — 6 f° r any c-fermion. 

After studying the weak-field fermionic QH state we obtain 

(i2.n) n l 0 = ^/e 

for the density of the c-fermions with Q fluxons, where is the electron density. The points of QHE 
lie on the straight line when p H plotted as a function of the magnetic field as seen in Figure 12.4 . 

Physically when Q is high, the LL separation is great and the c-fermion formation is more difficult. 
The c-boson contains two c-fermions. We calculate the Hall conductivity o H and obtain 
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(12.12) &o h 

where Dq is the c-boson density and n^ the fluxon density at v = 1/2. 

The QHE states with integers P = 1,2, ••• are generated on the weaker field side. Their strength 
decreases with increasing P. Thus, we have obtained the rule (12.9) within the framework of the 
traditional fractional QHE theory [4]. 

In summary, we have successfully described the QHE in graphene without introducing Dirac 
fermions. In solid state physics we deal with “electrons” and “holes”, which move in crystals and 
respond to the Lorentz force. These charged particles are considered as Bloch wave packets having 
size and charges (not a point particle). The bare point-like Dirac particle, if it exists, would be 
dressed with charge cloud, and it would then acquire a mass. It is very difficult to theoretically argue 
that Dirac fermions appear only in graphene but nowhere. The relativistic Dirac electron moves at the 
speed of light, c. The observed particle in graphene moves at a speed of the order of 10 8 ms -1 , which 
is much lower than the speed of light c = 3 x 10 8 ms -1 . It is difficult to explain this difference of two 
orders of magnitude from first principles. The Dirac fermion model is inherently connected to the 
Wigner-Seitz cell model, which is rejected in favor of the rectangular unit cell model in this book. 
The plateau observed for the Hall conductivity o H is caused by the condensed c-bosons. The plateau 

behavior arises from the superconducting state and hence it is unlikely to be explained based on the 
Dirac fermion model. 
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Chapter 13 


Seebeck Coefficient in Multiwalled Carbon 

Nanotubes 


Strictly speaking, the Seebeck coefficient, S, also called the thermoelectric power, is not an electrical 
transport property. But as we see below, the measurement of Seebeck coefficients gives important 
information about the carriers in electrical transport. Because of this we discuss the Seebeck 
coefficient in this chapter. 

On the basis of the idea that different temperatures generate different carrier densities and the 
resulting carrier diffusion generates a thermal electromotive force (emf), a new formula for the 
Seebeck coefficient (thermopower) S is obtained: 
s _ 2 In 2 Cf^b'Pq 
3 qn 

where k B is the Boltzmann constant, q, n, s F , and vq are charge, carrier density, Fermi energy, and the 
density of states at s F , respectively. Ohmic and Seebeck currents are fundamentally different in nature, 

and hence, cause significantly different transport behaviors. For example, the Seebeck coefficient S in 
copper (Cu) is positive, while the Hall coefficient is negative. In general, the Einstein relation 
between the conductivity and the diffusion coefficient does not hold for a multicarrier metal. 
Multiwalled carbon nanotubes are superconductors. The Seebeck coefficient S in multiwalled carbon 
nanotubes is shown to be proportional to the temperature T above the superconducting temperature T c 


based on the model of Cooper pairs as carriers. The S below T c follows a temperature behavior: 



where T' is constant at the lowest temperatures. 


13.1 Introduction 

In 2003 Kang et al. [1] observed a logarithmic temperature (T)-dependence for the Seebeck 
coefficient S in multiwalled carbon nanotubes (MWNTs) at low temperatures. Their data are 
reproduced in Figure 13.1 after [1, Figure 2], where S/T are plotted on a logarithmic temperature 
scale. Above 20 K S is proportional to T: 

Figure 13.1 Low-temperature Seebeck coefficient S of MWNTs plotted as S/T on a logarithmic 
temperature scale 

(reproduced from [1, Figure 2]). 






Below 20 K the curves follow a logarithmic behavior: 

(13.2) S ~ Tin T , 7<20K. 

The data are shown for three samples with different doping levels: A, B, and C. If a system of free 
electrons with a uniform distribution of impurities is considered, then the Seebeck coefficient, also 
called the thermoelectric power, S is temperature-independent which will be shown in Section 13.6. 
Hence, the behavior of T in both fl3.1I and f!3.21 is unusual. If the Cooper pairs (pairons) [2] are 
charge carriers and other conditions are met, then both fl3.ll and f!3.2f are explained 
microscopically, which is shown in this chapter. 

The extended data up to 300 K obtained by Kang et al. [1] are shown in Figure 13.2 . after [1, 
Figure 1]. In Figure 13.2a the S of MWNTs is shown, indicating a clear suppression of S from 
linearity below 20 K as seen in the lower-right inset. In Figure 13.2b . the Seebeck coefficient S of 
highly oriented single-crystal pyrolytic graphite (HOPG) is shown. This S is negative (“electron”- 
like) at low temperatures and become positive (“hole”-like) and constant above 150 K: 


Figure 13.2 (a) The temperature dependence of thermoelectric power of MWNTs at several doping 
levels. The suppression of TEP from linearity at low temperatures is clearly shown in the lower-right 
inset (the line represents a linear T dependence), (b) The thermoelectric power of single-crystal 
HOPG and glassy carbons. No suppression can be recognized for both as T —► 0 

(reproduced from [1, Figure 1]). 















The “electron” (“hole”) is a quasielectron which has an energy higher (lower) than the Fermi 
energy and which circulates counterclockwise (clockwise) when viewed from the tip of the applied 
magnetic field vector. “Electrons” (“holes”) are excited on the positive (negative) side of the Fermi 
surface with the convention that the positive normal vector at the surface points in the energy- 
increasing direction. Graphite is composed of ABAB-type graphene layers. The different T-behaviors 
for graphite (3D) and MWNTs (2D) should arise from the different carriers. We will show that the 
majority carriers in graphene and graphite are “electrons” while the majority carriers in MWNT are 
“holes” based on the rectangular unit cell model, which is shown in Sections 13.5 and 13.6. Note that 
conduction electrons are denoted by quotation marked “electrons” (“holes”) whereas generic 
electrons are denoted without quotation marks. 




13.2 Classical Theory of the Seebeck 
Coefficient in a Metal 

We now take a system of free electrons with mass m and charge —e with a uniform distribution of 
inpurities which act as scatterers. We assume that a free classical electron system in equilibrium is 
characterized by the ideal gas condition so that the average electron energy s depends on the 
temperature T only: 

(13.4) f = r(T), 

where n is the electron density. The electric current density j is given by 
(13.51 j = (—e)(iK , 

where v is the average velocity. We assume that the density n is constant in space and time. If there is 
a temperature gradient, then there will be a current as shown below. We assume first a one¬ 
dimensional (ID) motion. The velocity field v depends on the temperature T, which varies in space. 
Assume that the temperature T is higher at x + Ax than at x: 

(13.6) T (* + Ax ) > T ( x )* 

Then 


v[n, T(x + A#)] — T(?c)] — 


3v(n,T) dT 


Ax , 


(13.7) L ' ' n 1 ' " 3T 

The diffusion and heat conduction occur locally. We may choose Ax to be a mean free path: 

H3.8) J = vt , 

which is constant in our system Then the current j is, from (13.5) . 

(13.9) j * ^ n 3T dx * 

When a metallic bar is subjected to a voltage (V) or temperature (T) difference, an electric current 
is generated. For small voltage and temperature gradients we may assume a linear relation between 
the electric current density j and the gradients: 

(13.10) j = <>(-V V) + A(-VT) = oE - AVT , 

where E = - VV is the electric field and a the conductivity. If the ends of the conducting bar are 
maintained at different temperatures, no electric current flows. Thus, from 113.101 . we obtain 

(13.11) tfEs-AVT = 0, 

where E s is the field generated by the Seebeck electromotive force (emf). The Seebeck coefficient S 
is defined through 


Es - SVT , 


A 

S s — 
cr 


n 3.12i 

The conductivity a is positive, but the Seebeck coefficient S can be positive or negative. The 
measured Seebeck coefficient S in A1 at high temperatures (400-670°C) is negative, while the S in 
noble metals (Copper (Cu), silver (Ag), and gold (Au)) are positive as shown in Figure 13.3 . 










Figure 13.3 High-temperature Seebeck coefficients above 400°C for Ag, Al, Au, and Cu. The solid 
and dashed lines represent two experimental data sets. Taken from [3]. The discontinuities arise from 
melting. 
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On the basis of the classical idea that different temperatures generate different electron drift 
velocities, we obtain the Seebeck coefficient (Problem 13.3.1): 

S = 

113.131 ine ' 

where cy is the heat capacity per unit volume and n the electron density. Using 113.81-113.101 . we 
obtain 

Bv 

A = (— e)n — vi . 

113.141 v ' 3T 

The conductivity a is given by the Drude formula: 


ne 2 z 


a = 


113.151 m 
Thus, the Seebeck coefficient S is, using 113.141 and 113.151 . 

1 3v 2 


s = ± = 

a 


(13.16) 
where 

_ 

(13.17) C ~ ~BT 


1 3v / 

— m - 

ne 3T i 

1 3 1 


•ni¬ 


ne BT 2 


tnv = 


ne 3T 
1 3c 

ne ST 


1 


ne 


■c > 


is the heat capacity per electron. 

Our theory can be extended simply for 3D motion. The equipartition theorem holds for the classical 
electrons: 




(13.18) 

where the angular brackets mean the equilibrium averages. Hence, the average energy is 









































(13.19) 
We obtain 


s = Jtnv 2 = i {yl + v] + vj) = jk B T . 


1 Sv 1 

A — —en —-—r , 
(13.20) 2 St 


Using this, we obtain the Seebeck coefficient for 3D motion as 

c v fee 

3 ne 2e 

where 


S = — = 
(13.21) a 


_ 3c 3^ 

(13.22) CV = Jf~ 2 B 

is the heat capacity per electron. The heat capacity per unit volume, c^, is related to the heat capacity 
per electron, c, by 


(13.23) cy = nc . 


Setting Cy equal to 3nk B /2 in U 3.131 . we obtain the classical formula for S: 


■^classical = = -0.43 X 10 -4 VK -1 = -43 uVK" 1 . 

(13.24) dassical 2e M 

Observed Seebeck coefficients in metals at room temperature are of the order of microvolts per 
degree (see Figure 13.3 1. a factor of ten smaller than S c]assical . If we introduce the heat capacity 

computed using Fermi statistics [4] (Problem 13.3.2) 


1 2 , 1 2 f T 

cy - —n nk B -= -n nk B — , 

03.251 2 c p 2 Tp 

where T F (e F ) is the Fermi temperature (energy), we obtain 


r n k E k E T 

^ semiquantum — 7 j 

03.261 6 e 

which is often quoted in materials handbook [3]. Formula 03.261 remedies the difficulty with respect 
to the magnitude. But the correct theory must explain the two possible signs of S besides the 
magnitude. Fujita, Ho, and Okamura [5] developed a quantum theory of the Seebeck coefficient. We 
follow this theory below. 


13.3 Quantum Theory of the Seebeck 
Coefficient in a Metal 

Let us recall that “electrons” (“holes”) are thermally excited above (below) the Fermi energy. This 
means that the existence of “holes” by itself is a quantum effect. 

We assume that the carriers are conduction electrons (“electron”, “hole”) with charge q (—e for 
“electrons,” +e for “holes”) and effective mass m*. Assuming a one-component system, the Drude 
conductivity a is given by 











H 3.271 m* 

where n is the carrier density and x the mean free time. We observe from H3.271 that a is always 
positive irrespective of whether q = —e or +e. The Fermi distribution function f is 

(13 V (e;M = ^TT’ 

where p is the chemical potential whose value at 0 K equals the Fermi energy s F . The voltage 

difference AV = LE, with L being the sample length, generates the chemical potential difference Ap, 
the change in f, and consequently, the electric current. Similarly, the temperature difference AT 
generates the change in f and the current. 

At 0 K the Fermi surface is sharp and there are no conduction electrons (“electrons,” “holes”). At a 
finite T, “electrons” (“holes”) are thermally excited near the Fermi surface if the curvature of the 
surface is negative (positive), see Figures 13.4 and 13.5 . We assume a high Fermi degeneracy: 

fl 3.291 T f » T . 

Figure 13.4 (a,b) More “electrons” (dots) are excited above the Fermi surface (solid line) at the 
high-temperature end: T 2 (> T t ). The shaded area denotes the electron-filled states. “Electrons” 

diffuse from (b) to (a). 




Figure 13.5 (a,b) More “holes” (open circles) are excited below the Fermi surface at the high- 
temperature end: T 2 (> T,). “Holes” diffuse from(b) to (a). 

71 < T 2 




Consider first the case of “electrons.” The number of thermally excited “electrons,” N x , having 
energies greater than the Fermi energy e F is defined and calculated as (Problem 13.3.3) 












CO CO 

N x = f df'D(f’)—r —— -~ £>o / df—r—- 

£ f fr 

imoi = - p ^~ l ['" i 1 + = ln2t ° m - 

where £>0 = d(s f ) is the density of states at s = e F . The excited “electron” density n = N x /y, where y 

is the sample volume, is higher at the high-temperature end, and the particle current runs from the 
high- to the low-temperature end. This means that the electric current runs towards (away from) the 
high-temperature end in an “electron” (“hole”)-rich material. After using formula (13.121 . we find 
S < 0 for "electrons" 

(13.31) S > 0 for "holes” . 

The Seebeck current arises from the thermal diffusion. We assume Fick’s law: 


(13.32) j = 9Jparticte =. 

where D is the diffusion constant, which is computed from the kinetic-theoretical formula: 


D — —Vl= — vlx 


V — V F 


1 — VT 


(1333) d d 

where d is the dimension in this chapter. The density gradient Vn is generated by the temperature 

gradient VT, and is given by 

_ In 2 T >o 

Vn= - Ib^oVT, Dq = —- , 

(13.34) d V 

where n3.30f is used. Using the last three equations and (1330), we obtain 
(1335) A = ln2qijk E V 0 T . 

Using (1330), (1337), and (1335), we obtain (Problem 13.3.4) 


_ A 

(13.36) o 


2 In 2 


(-) 

V qn } 




The mean free time x cancels out from the numerator and denominator. 

The derivation of our formula U3.36f for the Seebeck coefficient S was based on the idea that the 
Seebeck emf arises from the thermal diffusion. We used the high Fermi degeneracy condition (13.291 : 
T f » T. The relative errors due to this approximation and due to the neglect of the T-dependence of g 

are both of the order (k B T/s F ) 2 . Formula (13361 can be negative or positive, while the materials 

handbook formula (13.261 has a negative sign. The average speed v for highly degenerate electrons is 
equal to the Fermi velocity v F (independent of T). In Ashcroft and Mermin’s book [4], the origin of a 


positive S in terms of a mass tensor M = {mjj} is discussed. This tensor M is real and symmetric, and 
hence, it can be characterized by the principal masses {n^}. The formula for S obtained by Ashcroft 

and Mermin [4, Equation (13.621 ]. can be positive or negative but is hard to apply in practice. In 
contrast our formula (13361 can be applied straightforwardly. Besides our formula for a one-carrier 
system is T-independent, while Ashcroft and Mermin’s formula is linear in T. 

Formula (13361 is remarkably similar to the standard formula for the Hall coefficient of a one- 
component system: 






















(13.37) r h = (qn) 1 • 

Both Seebeck and Hall coefficients are inversely proportional to charge q, and hence, they give 
important information about the sign of the carrier charge. In fact the measurement of the S of a 
semiconductor can be used to see if the conductor is n-type or p-type (with no magnetic 
measurements). If only one kind of carrier exists in a conductor, then the Seebeck and Hall 
coefficients must have the same sign as observed in alkali metals. 

Let us consider the electric current caused by a voltage difference. The current is generated by the 
electric force that acts on all electrons. The electron’s response depends on its mass m*. The density 
(n) dependence of a can be understood by examining the current-carrying steady state in Figure 13.6b . 
The electric field E displaces the electron distribution by a small amount h -1 q Ex from the 
equilibrium distribution in Figure 13.6a . Since all the conduction electrons are displaced, the 
conductivity a depends on the particle density n. The Seebeck current is caused by the density 
difference in the thermally excited electrons near the Fermi surface, and hence, the thermal diffusion 
coefficient A depends on the density of states at the Fermi energy, x>o (see (13.35 ) 1. We further note 

that the diffusion coefficient D does not depend on m directly (see (13.33B . Thus, the Ohmic and 
Seebeck currents are fundamentally different in nature. 

Figure 13.6 As the electric field E points in the positive x-direction, the steady-state electron 
distribution in (b) is generated by a translation of the equilibrium distribution in (a) by the amount 8k 
= -eE x/h. 




For a single-carrier metal such as sodium which forms a body-centered cubic (bcc) lattice, where 
only “electrons” exist, both R H and S are negative. The Einstein relation between the conductivity a 

and the diffusion coefficient D holds: 

(13.381 a oc D , 

Using (13.271 and (13.331 . we obtain 
D VpX/3 2 £ F 

39 ^ a nq 2 T/m* 3 nq 2 

which is a material constant. The Einstein relation is valid for a single-carrier system. 

The relation does not hold in general for multicarrier systems. The ratio D/a for a two-carrier 
system containing “electrons” (1) and “holes” (2) is given by (Problem 13.3.5) 






















D _ l/3u 1 2 r 1 + 1 /3 u 2 t 2 

(13.40f a + ( n 2<lll™5)*2 ' 

which is a complicated function of (m/Viii/), (ni/n 2 ), C^Afy), and (X|/x 2 ). hi particular the mass ratio 
m*lm£ may vary significantly for a heavy fermion condition, which occurs whenever the Fermi 


surface just touches the Brillouin boundary, see below. An experimental check on the violation of the 
Einstein relation can be carried out by simply examining the T dependence of the ratio D/a. This ratio 
from (13.39f is constant for a single-carrier system, while from fl3.40f it depends on T since the 
generally T-dependent mean free times (x 1? x 2 ) arising from the electron-phonon scattering do not 

cancel out from numerator and denominator. Conversely, if the Einstein relation holds for a metal, the 
spherical Fermi surface approximation with a single effective mass m* is valid for this single-carrier 
metal. 

Problem 13.3.1. Derive (13.13f using kinetic theory. 

Problem 13.3.2. Derive fl3.25f applying the Fermi statistics for free carriers. 

Problem 13.3.3. Verify H 3.30V Hint: 


4- Ln(l + = 

dr 


X _|_ e -^ (£—.«) 


Problem 13.3.4. Verify H 3.36V 
Problem 13.3.5. Derive (13.40) . 


13.4 Simple Applications 

We consider two-carrier metals (noble metals). Noble metals including Cu, Ag, and Au form face- 
centered cubic (fee) lattices. Each metal contains “electrons” and “holes.” The Seebeck coefficient S 
for these metals are shown in Figure 13.3 . The S is positive for all: 

( 13 . 41 ) S > 0 forCu,Au,Ag, 

indicating that the major carriers are “holes.” The Hall coefficient R H is known to be negative: 

(13.42) Rh < 0 for Cu, Au, Ag . 

Clearly the Einstein relation (13.38f does not hold since the charge sign is different for S and R H . 

This complication was explained by Fujita, Ho, and Okamura [5] based on the Fermi surfaces having 
“necks” (see Figure 13.7 V The curvatures along the axes of each neck are positive, and hence, the 
Fermi surface is “hole”-generating. Experiments [6-8] indicate that the minimum neck area A m 

(neck) in the k-space is 1/51 of the maximum belly area A in (belly), meaning that the Fermi surface 

just touches the Brillouin boundary ( Figure 13.7 exaggerates the neck area). The density of “hole”- 
like states, n hole , associated with the {111) necks, having a heavy fermion character due to the rapidly 

varying Fermi surface with energy, is much greater than that of “electron”-like states, n electron , 
associated with the {100) belly. The thermally excited “hole” density is higher than the “electron” 
density, yielding a positive S. The principal mass n^* along the axis of a small neck (mj* 1 = d 2 e/ 
Vpj 2 ) is positive (“hole”-like) and extremely large. The contribution of the “hole” to the conduction is 
















^ 1 

small (a oc m _1 ). Then the “electrons” associated with the nonneck Fermi surface dominate and yield 
a negative Hall coefficient R H . 


Figure 13.7 The Fermi surface of silver (fee) has “necks,” with the axes in the (111) direction, located 
near the Brillouin boundary, reproduced after [6-8]. 



We note that the Einstein relation (T3.381 does not hold in general for multicarrier systems. If the 
Einstein relation holds for a metal, the spherical Fermi surface approximation with a single effective 
mass m is valid. 

13.5 Graphene and Carbon Nanotubes 

Graphite and diamond are both made of carbons. They have different lattice structures and different 
properties. Diamond is brilliant and it is an insulator while graphite is black and is a good conductor. 
In 1991 Iijima [9] discovered carbon nanotubes in the soot created in an electric discharge between 
two carbon electrodes. These nanotubes ranging from 4 to 30 nm in diameter were found to have a 
helical multiwalled structure. The tube length is about one micron (pm). Single-wall nanotubes 
(SWNTs) were fabricated first by Iijima and Ichihashi [10] and by Bethune et al. [11] in 1993. The 
tube size is about 1 nm in diameter and a few microns in length. The scroll-type tube is called a 
multiwalled carbon nanotube (MWNT). The tube size is about 10 nm in diameter and a few microns 
(pm) in length. An unrolled carbon sheet is called graphene, which has a honeycomb lattice structure 
as shown in Figure 13.8 . 

Figure 13.8 A honeycomb lattice and the Cartesian (rectangular) unit cell (dotted line) of graphene. 
The rectangular unit cell contains four C + ions represented by open circles (©). 
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We consider graphene which forms a 2D honeycomb lattice. The normal carriers in the transport of 
electrical charge are “electrons” and “holes.” Following Ashcroft and Mermin [4], we assume the 
semiclassical (wavepacket) model of a conduction electron. It is necessary to introduce a k-vector: 

(13.43) ^ — kx^x + k y e y + 

since the k-vector is involved in the semiclassical equation of motion: 
t ; , dJt 

(13.44) “ 5 ‘jF = !(E+ " 8) ’ 

where E and B are the electric and magnetic fields, respectively. The vector 

1 3c 

(13.45) V = ~hdk 

is the particle velocity, where s is the particle energy. For some crystals such as simple cubic, face- 
centered cubic, body-centered cubic, tetragonal, and orthorhombic crystals, the choice of the 
orthogonal (x, y, z)-axes and the unit cells are obvious. Two-dimensional crystals such as graphene 
can also be treated similarly, with only the z-component being dropped. We will show that graphene 
has “electrons” and “holes” based on the rectangular unit cell model. 

We assume that the “electron” (“hole”) wavepacket has a charge —e (+e) and the size of a unit 
carbon hexagon, generated above (below) the Fermi energy s F . We will show that (a) the “electron” 

and “hole” have different charge distributions and different effective masses, (b) that the “electrons” 
and “holes” are thermally activated with different energy gaps (e 1? e 2 ), and (c) that the “electrons” 

and “holes” move in different easy channels along which they travel. 

The positively charged “hole” tends to stay away from the positive C + ions, and hence its charge is 
concentrated at the center of the hexagon. The negatively charged electron tends to stay close to the C + 
hexagon and its charge is therefore concentrated near the C + hexagon. In our model, the “electron” and 
“hole” both have sizes and charge distributions, and they are not point particles. Hence, their masses 
ni| (“electron”) and m 2 (“hole”) must be different from the gravitational mass m = 9.11 x 10 -28 g. 

Because of the different internal charge distributions, the “electrons” and “holes” have different 
effective masses m 2 and m 2 . The “electron” may move easily with a smaller effective mass in the 

direction [110 c-axis] = [110] than perpendicular to it as we see presently. Here, we use the 
conventional Miller indices for the hexagonal lattice with omission of the c-axis index. For the 














description of the electron in terms of the mass tensor, it is necessary to introduce Cartesian 
coordinates, which do not match with the crystal’s natural (triangular) axes. We may choose the unit 
cell as shown in Figure 13.8 . Then the Brillouin zone boundary in the k space is a rectangle with side 
lengths (2n/b, 27t/c). The “electron” (wavepacket) may move up or down in [110] to the neighboring 
hexagon sites passing over one C + . The positively charged C + acts as a welcoming (favorable) 
potential valley center for the negatively charged “electron,” while the same C + acts as a hindering 
potential hill for the positively charged “hole.” The “hole” can, however, move easily horizontally 
without ever meeting the hindering potential hills. Then, the easy channel directions for the 
“electrons” and “holes” are [110] and [001], respectively. 

The thermally activated electron densities are then given by [13] 

(13.46) n i( T ) , 

where j = 1 and 2 represent the “electron” and “hole,” respectively. The prefactor nj is the density at 
the high-temperature limit. 


13.6 Conduction in Multiwalled Carbon 
Nanotubes 


MWNTs are open-ended. Hence, each pitch is likely to contain an irrational number of carbon 
hexagons. Then, the electrical conduction of MWNTs is similar to that of metallic SWNTs [14]. 

Phonons are excited based on the same Cartesian unit cell as the conduction electrons in the carbon 
wall. The phonon-exchange interaction binds Cooper pairs, also called pairons [2]. 

The conductivity a based on the pairon carrier model is calculated as follows. The pairons move in 
2D with the linear dispersion relation [2]: 


(43.47! 

(13.48) 


£p = C^-^p , 

c iJ > = — 




where v F ® is the Fermi velocity of the “electron” (] = 1) (“hole” (j =2)). 


Consider first “electron”-pairs. The velocity v is given by (omitting superscript) 


di\ 


v = 


or Uv — 


Bp 


= c 


Px 


(43.491 Bp dp dp x p 

where we used 113.471 for the pairon energy s p and the 2D momentum, 


P = (pl + Pr) 


1/2 


(13.50) 

The equation of motion along the electric field E in the x-direction is 
113.511 Bt 

where q' is the charge ±2e of a pairon. The solution of 113.511 is given by 


= q'E > 









> r + , (0) 

(13.52) Px = q Et + py , 

where p x (U) is the initial momentum component. The current density j p is calculated from (charge q') x 
(number density np) x (average velocity 7 ). The average velocity is calculated by using 03.491 and 

03.521 with the assumption that the pairon is accelerated only for the mean tree time x and the initial- 
momentum-dependent terms are averaged out to zero. We then obtain 

Jp = q n p i> = q n p c— = q n p —Ez . 

(13.53) P P 

For stationary currents, the partial pairon density rip is given by the Bose distribution function f(s p ): 

(13.54) n p = f( f p) Np (P £ p - °) ~ i] -1 > 


where e a is the fiigacity. Integrating the current j p over all 2D p-space, and using Ohm’s law j = oE, 
we obtain for the conductivity o: 


o = ( 2 ^fi) 2 q 2 c I d £ pp L f{e v )t 


03.551 


/ 


-1 


In the low temperatures we may assume the Boltzmann distribution function for f(e p ): 

(13.56) /( £ p) - ®P “ 0 f p) • 

We assume that the relaxation time (inverse collision frequency) arises from the phonon scattering so 
that 


(13.57) r — (aT) 1 f a = constant. 

After performing the p-integration we obtain from 03.551 
l g 2 ^B g 

(13.58) ° ~ n ah 2 € f 

which is temperature-independent. If there are “electron” and “hole” pairons, they contribute 
additively to the conductivity. These pairons should undergo a Bose-Einstein condensation at lowest 
temperatures. 


13.7 Seebeck Coefficient in Multiwalled 
Carbon Nanotubes 

We are now ready to discuss the Seebeck coefficient S of MWNTs. First, we will show that the S is 
proportional to the temperature T above the superconducting temperature T c . 

We start with the standard formula for the charge current density: 

(13.59) J = <1*™ > 

where it is the average velocity, which is a function of temperature T and the particle density n: 

(13.60) * - »( n ’ T ) ■ 

We assume a steady state of the system in which the temperature T varies only in the x-direction while 
the density is kept constant. The temperature gradient dT/dx generates a current (Problem 13.7.1): 








(13.61) J J ar dx 

The thermal diffusion occurs locally. We may choose Ax to be a mean free path: 

113.621 A x = i! — vt . 

The current density, j p , at the 2D pairon momentum p, which is generated by the temperature 

gradient dT/dx, is thus given by 

, , mdT 

f D = q nu^vlnn.T) = q tt D -—-—vt . 

113.631 ip ^ p n p ' 1 ^ P 5T dx 

Integrating 113.631 over all 2D p-space and comparing with 113.101 . we obtain (Problem 13.7.2) 

113.641 A= [ d 2 pv x f(E p )T= (2 nh)~ 2 q'^c j d 2 py/(e p )r . 

We compare this integral with the integral in 113.551 . It has an extra factor in p and therefore 
generates an extra factor T when the Boltzmann distribution function is adopted for f(Sp). Thus, we 

obtain (Problem 13.7.3), using 113.551 and 113.641 . 

A 

S = - oc T . 

113.651 a 

We next consider the system below the superconducting temperature T c . The supercurrents arising 

from the condensed pairons generate no thermal diffusion. But noncondensed pairons can be scattered 
by impurities and phonons, and contribute to a thermal diffusion. Because of the zero-temperature 
energy gap 

(13.66) r s = kfilg , 

generated by the supercondensate, the population of the noncondensed pairons is reduced by the 
Bo 1 tzmann-Arrheni us factor 

(13 . 67) eXP (-i&) =eXP (-T)' 

This reduction applies only for the conductivity (but not for the diffusion). Hence, we obtain the 
Seebeck coefficient (Problem 13.7.4): 

_ A T 

H 3.681 S “ O “ exp(-T g /T) ' 

In the experiment in [1] a MWNT bundle containing hundreds of individual nanotubes was used. Both 
circumference and pitch have distributions. Hence, the energy gap s g (= k B T g ) has a distribution. 

Kang et al. [1] measured the conductance G, which is proportional to the conductivity o, of the 
MWNT samples. Their data are reproduced in Figure 13.9 . after [1, Figure 3], where the conductance 
G as a function of temperature is plotted on a logarithmic scale. The G arising from the conduction 
electron in each MWNT carries an Arrhenius-type exponential 


Figure 13.9 The conductance G of the multiwalled carbon nanotube samples as a function of 
temperature 

(after [1, Figure 3]). 



















£a = 


(13.69) 


exp 



where s a is the activation energy. This energy s a has a distribution since the MWNTs have varied 

circumferences and pitches. The temperature behavior of G for the bundle of MWNTs is seen to be 
represented by 



(13.70) V 1 ) 

in the range: 5-20 K. The electron-activation energy s a and the zero-temperature pairon energy gap s g 
are different from each other. But they have the same orders of magnitude and both are temperature- 


independent. We assume that the distributions are similar. We may then replace exp(-T g /T) in 113.681 
by (T'g/T) 1/2 , obtaining the Seebeck coefficient for a bundle of MWNTs 



or 


(13.72) In Sbundie ^ Tin 7 below 20K , 
which is observed in Figure 13.1 . 

The data in Figure 13.1 clearly indicates a phase change at the temperature 

(13.73) T 0 = 20 K, 

We now discuss the connection between this T 0 and the superconducting temperature T c . We deal with 

a thermal diffusion of the MWNT bundle. The diffusion occurs most effectively for the most 
dissipative samples which correspond to those with the lowest superconducting temperatures. Hence, 
the T 0 observed can be interpreted as the superconducting temperature of the most dissipative 

samples. 

In contrast the conduction is dominated by the least dissipative samples having the highest T c . 

Figure 13.9 shows a clear deviation of G around 120 K from the experimental law: G ~ In T. We may 
interpret this as an indication of the limit of the superconducting states. We then obtain 


(13.74) T c - 120 K 











for the good samples. 

By considering moving pairons we obtained the T-linear behavior of the Seebeck coefficient S 
above the superconducting temperature T c and the T In T-behavior of S at the lowest temperatures. 

The energy gap s g vanishes at T c . Hence, the temperature behaviors should be smooth and monotonic 
as observed in Figure 13.1 . This supports our interpretation of the data based on the superconducting 
phase transition. The doping changes the pairon density and the superconducting temperature. Hence, 
the data for A, B, and C in Figure 13.1 are reasonable. 

Problem 13.7.1. Assume that the temperature T is higher at x + Ax than at x: T(x + Ax) > T(x). Noting 
that the velocity field (average velocity) v depends on the local temperature T(x), show that 

awn f r)3r 

v(T(x + Ax)) - v(T(x)) = -L—L—Ax 

0 1 ox 

holds for v = v(T(x)). Applying this formula, derive (13.61) . 

Problem 13.7.2. Derive (13.64) . 

Problem 13.7.3. Show (13.65) that the Seebeck coefficient is proportional to temperature T. 

Problem 13.7.4. Derive (13.68) . 
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Chapter 14 


Miscellaneous 


14.1 Metal-Insulator Transition in Vanadium 
Dioxide 

Vanadium dioxide (V0 2 ) undergoes a metal-insulator transition (M IT) at 340 K with the structural 

change from a tetragonal (tet) to monoclinic (mcl) crystal as the temperature is lowered. The 
conductivity a drops at MIT by four orders of magnitude. The low-temperature monoclinic phase is 
known to have a lower ground state energy. The existence of a k-vector k is a prerequisite for the 
conduction since the k appears in the semiclassical equation of motion for the conduction electron 
(wavepacket). Each wavepacket is, by assumption, composed of plane waves proceeding in a k 
direction perpendicular to the plane. The tetragonal (V0 2 ) 3 unit cells are periodic along the crystal’s 

x-, y-, and z-axes, and hence there are three-dimensional k-vectors. There are one-dimensional k 
along the c-axis for a monoclinic crystal. We argue that this decrease in the dimensionality of the k- 
vectors is the cause of the conductivity drop. 

14.1.1 Introduction 

In 1959 Morin reported his discovery of a metal-insulator transition in vanadium dioxide (V0 2 ) [1]. 
The compound V0 2 forms a monoclinic crystal on the low-temperature side and a tetragonal crystal 
on the high-temperature side. When heated, V0 2 undergoes an insulator-metal transition around 340 

K, with resistivity drops by four orders of magnitude as T is raised. The phase change carries a 
hysteresis similar to a ferro-paramagnetic phase change. The origin of the phase transition has been 
attributed by some authors to Peierls instability driven by strong electron-phonon interaction [2], or 
to Coulomb repulsion and electron localization due to the electron-electron interaction on a Mott- 
Hubbard picture by other authors [3-5]. 

A simpler view on the MIT is presented here. We assume that the electron wavepacket is composed 
of superposable plane waves characterized by the k-vectors. The superposability is a basic property 
of the Schrodinger wavefimction in free space. A mcl crystal can be generated from an orthorhombic 
crystal (ore) by distorting the rectangular faces perpendicular to the c-axis into parallelograms. 
Material plane waves proceeding along the c-axis exist since the (x, y) planes containing materials 
(atoms) perpendicular to the z-axis are periodic. It then has one-dimensional (ID) k-vectors along the 
c-axis. In the x-y plane there is an oblique net whose comers are occupied by V’s for mcl V0 2 . The 

position vector R of every V can be represented by integers (m, n), if we choose 
04.11 &mn — (ttfli + na 2 , 



where and a 2 are nonorthogonal base vectors. In the field theoretical formulation the field point r 
is given by 

f 14.21 1' — i + t 

where r’ is the point defined within the standard unit cell. Equation 114.21 describes the 2D lattice 
periodicity but does not establish k-space as shown earlier, in Section 5.2, Chapter 5. 

If we omit the kinetic energy term, then we can still use 114.11 and obtain the ground state energy 
(except the zero-point energy). The reduction in the dimensionality of the k-vectors from 3D to ID is 
the cause of the conductivity drop in the MIT. 

The MIT proceeds by domains since the insulator (mcl) phase has the lower degrees of symmetry. 
Strictly speaking, the existence of ID k-vectors allows the mcl material to have a small conductivity. 
This happens for V0 2 , see below. 

Wu et al. [6] measured the resistance R of individual nanowires W x V 1 _ x 0 2 with tungsten (W) 

concentration x ranging up to 1.14%. The nanowires are grown with the wire axis matching the c-axis 
of the high-temperature rutile structure. The transition temperature T 0 decreases from 340 K, passing 

room temperature, to 296 K as the concentration x changes from 0 to 1.14%. The temperature 
dependence of R for the low-temperature phase is semiconductor-like. That is, the resistance R 
decreases with increasing temperature. The behavior can be fitted with the Arrhenius law: 

/ c a 


o oc R 1 oc exp 




114.31 " V ksT 

where s a is the activation energy. The activation energy s a is about 300 meV 
Whittaker et al. [7] measured the resistance R in a V0 2 nanowire and observed that (a) the 

resistance R for the low-temperature (mcl) phase shows an Arrhenius-type dependence as shown in 
114.31 . while (b) the resistance R for the high-temperature (tet) phase appears to be T-independent. 
These different behaviors may arise as follows. The currents in (a) run along the nanowire axis, 
which is also the easy c-axis of the mcl crystal. Hence, Arrhenius behavior for the conductivity is 
observed for case (a). For case (b), the currents run in three dimensions. Only those electrons near the 
Fermi surface are excited and participate in the transport. Then, the density of excited electrons, n x , is 

related to the total electron density n 0 by 

k B T 

n x - cn 0 -, 

(14.4) £a 

where c is a number close to unity. The factor T cancels out with the T-linear phonon scattering rate 
for the conductivity. The T-dependence of the exponential factor is small since the activation energy 
s a is much greater than the observation temperature scale measured in units of 10°C. Thus, the 

conductivity is nearly constant. There is a sudden drop of resistance around 300 K, when the two 
phases separate. 

In summary the MIT in V0 2 directly arises from the lattice structure change between the tet and the 

mcl crystal. The tet (mcl) crystal has 3D (ID) k-vectors. The reduction in the dimensionality of the k- 
vectors is the cause of the conductivity drop. 









14.2 Conduction Electrons in Graphite 

Graphite is composed of graphene layers stacked in the manner ABAB - along the c-axis. We may 
choose a Cartesian unit cell as shown in Figure 14.1 . 

Figure 14.1 The Cartesian unit cell (white solid lines) viewed from the top for graphite. The unit cell 
of graphite has two layers of graphene. The carbons (circles) in the A (B) planes are shown in black 
(gray). 



As Figure 14.1 shows the rectangle (solid line) in the A plane (black) contains six C’s wholly 
within and four C’s at the sides. The side C’s are shared by neighbors. Hence, the total number of C’s 
is 6 x l + 4 x 1/2 = 8 . The rectangle in the B plane (gray) contains five C’s within, four C’s at the 
sides, and four C’s at the corners. The total number of C’s is 5 x 1 + 4 x 1/2 + 4 x 1/4 = 8 . The unit 
cell therefore contains 16 C’s. The two rectangles are stacked vertically with the interlayer 
separation, c 0 = 3.35 A much greater than the nearest-neighbor distance between two C’s, a 0 = 1.42 

A. The unit cell has three side-lengths: 

H4.5f hi = 3a 0 , b 2 = 2V3a 0 , h 3 = 2c 0 . 

The center of the unit cell is empty. Clearly, the system is periodic along the orthogonal directions 
with the three periods (b 1? b 2 , b 3 ) given in (14.5) . We may assume that both the “electron” and “hole” 

have the same unit cell size. Thus, the system is orthorhombic with the sides (b 1? b 2 , b 3 ), bi + b 2 , bj + 

b 3 , b 2 ^ b 3 . 

The negatively charged “electrons” (with charge -e) in graphite are welcomed by the positively 
charged C + when moving in a vertical direction just as in graphene. That is, the easy directions of 
movement for the “electrons” are vertical. The easy directions for the “holes” are horizontal. There 
are no hindering hills for “holes” moving horizontally. Hence, just as for graphene, the “electron” in 
graphite has a lower activation energy s than the “hole”: 

(14.6) *T < c 2 * 

Because of this, the “electrons” are the majority carriers in graphite. 

Graphite and graphene have very different unit cells including dimensionality. Thus, the electrical 
transport behaviors must be significantly different. There are “electrons” and “holes” in graphite. We 








predict that graphite is a superconductor with a superconducting temperature of the order of 1 K. 


14.3 Coronet Fermi Surface in Beryllium 

Divalent beryllium (Be) forms a hexagonal close packed (hep) crystal. The Fermi surface in the 
second zone constructed in the Nearly Free Electron Model (NFEM) is represented by the “monster” 
( Figure 3. Sa t and the actually observed “coronet” ( Figure 3.8b l as shown in Figure 3.8 . The figures 
are drawn based on the Wigner-Seitz model. Part of the coronet Fermi surface can be fitted with the 
quadratic-in-k dispersion relation: 

^ 1 2 1 2 1 

E — Pi F p ? F Pi 

(14.7) 2 mf 2 m 2 F 2 m / 3 

with two negative effective masses and one positive effective mass. The coronet encloses unoccupied 
states. The effective masses (m 1? m 2 , m 3 ) may be directly obtained from the cyclotron resonance data, 

using Shockley’s formula [ 8 ]: 

w j m 2 m 3 cos 2 (/ 2 , #i) + m^mi cos 2 (/f, x 2 ) + m 1 m 2 cos 2 (w, x-$) 

(14 8) eB V m l m 2 m l 

where cos(p, Xj) is the directional cosine between the field direction p and the xj-direction. The 
Cartesian coordinate system (x 1? x 2 , x 3 ) can be chosen along the orthogonal unit cells for the hep 
crystal. It is a challenge to obtain the values of (m b m 2 , m 3 ) after choosing the orthogonal axes 
appropriately. 

14.4 Magnetic Oscillations in Bismuth 

Magnetic oscillations, de Haas-van Alphen and Shubnikov-de Haas oscillations, were discovered in 
bismuth (Bi), which forms a rhombohedral (rhl) crystal. A rhl crystal can be obtained by stretching 
the three body-diagonal distances from a simple cubic crystal as discussed in Sections 5.2 and 5.3. If 
an orthogonal unit cell with the Cartesian axes along the body-diagonal passing six corner atoms is 
chosen, then the system is periodic along the x-, y-, and z-axes passing the center. Thus, the system 
can be regarded as an ore, which has a 3D k-space. It is a challenge to obtain the effective masses 
(ni|, m 2 , m 3 ) inBi from the cyclotron resonance and other measured data. 
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Appendix 

A.l Second Quantization 

The most remarkable fact about a system of fermions is that no more than one fermion can occupy a 
quantum particle state (Pauli’s exclusion principle). For bosons no such restriction applies. That is, 
any number of bosons can occupy the same state. We shall discuss the second-quantization formalism 
in which creation and annihilation operators associated with each quantum state are used. This 
formalism is extremely useful in treating many-boson and/or many-fermion systems. Zero-mass 
bosons such as photons and phonons can be created or annihilated. These dynamical processes can 
only be described in second quantization. 

A.1.1 Boson Creation and Annihilation Operators 

Use of creation and annihilation operators (a’’', a) was demonstrated in Chapter 4, Section 4.3 for the 
treatment of a harmonic oscillator. Second quantization can be used for a general many-body system, 
as we see below. 

The quantum state for a system of bosons (or fermions) can most conveniently be represented by a 
set of occupation numbers {n' a |, where n' a are the numbers of bosons (or fermions) occupying the 

quantum particle states a. This representation is called the occupation-number representation or 
simply the number representation. For bosons, the possible values for n' a are zero, one, or any 

positive integers: 

( Al ) n> a = 0) 1 . 2, ... for bosons. 

The many-boson state can best be represented by the distribution of particles (balls) in the states 
(boxes) as shown in Figures A.l . where we choose the ID momentum states, pj = 27ihj/L, as 

examples. 

Figure A.l A many-boson state is represented by a set of boson numbers {m} occupying the state 

{pj}- 



* • ■ T n -2 = 0, ra_i = 0, no = 2, 

*1 = 0, 7X2 = It ’ 

Let us introduce operators iLj, whose eigenvalues are given by 0, 1,2, ... Since (Al) is meant for 
each and every state a independently, we assume that 
(A2) K. »t] - n a n b - n h n a = 0 . 















It is convenient to introduce complex dynamic variables r| and if' instead of directly dealing with 
the number operators n. We attach labels a, b, ... and assume that r\ and if satisfy the following Bose 
commutation rules: 



Let us set 

(A4) vivo = ■*. = »!. 


which is Hermitian. The eigenvalue equation is 
(A5) *K) = "V) - 


where n' is the eigenvalue of n. We showed earlier in Section 4.3 that the operator n has as 


eigenvalues all nonnegative integers: 

(A6) 0 , 1 , 2 ,... 


The corresponding normalized eigenkets are given by 
(A7) fr'r^VfV}- 

Let |(|) a ) be a normalized eigenket of r^ belonging to the eigenvalue 0 so that 
(A8) "‘I'M = nU*\<P*) = o ■ 

By multiplying all these kets |(J) a ) together, we construct a normalized eigenket: 

(A9) W-IWI*i)- > 

which is a simultaneous eigenket of all n belonging to the eigenvalue zero. This ket is called the 
vacuum ket. It has the following property: 
fAlOf 9*1*0) = 0 for any a . 

From (A7) we see that if n' l5 n' 2 , ... are any nonnegative integers, then 



is a normalized simultaneous eigenket of all n belonging to the eigenvalues n' 1? n' 2 , ... Various kets 
obtained by taking different n' form a complete set of kets all orthogonal to each other. 

Following Dirac [1, 2], we postulate that the quantum states for N bosons can be represented by a 
symmetric ket: 



where S is the symmetrizing operator 



and P's are permutation operators for the particle indices (1, 2, ..., N). The ket in 1A121 is not 
normalized but 

1A141 —= |{»M 

is a normalized ket representing the same state. Comparing (A 14! and Eq. (All) , we obtain 


(A15) — ■ 


That is, unnormalized symmetric kets |a a a b ... a g ) s for the system can be constructed by applying N 












creation operators r| a ^r| b ^ • • • r| g ^ to the vacuum ket |<D 0 ). So far we have tacitly assumed that the total 

number of bosons is fixed at N'. If this number is not fixed but is variable, we can easily extend the 
theory to this case. 

Let us introduce a Hermitian operator N defined by 

n = w viva = y n * = ^ > 

(A16) . . 

the summation extending over the whole set of boson states. Clearly, the operator N has eigenvalues 
0, 1, 2, ..., and the ket |a a a b ••• a g } s is an eigenket of N belonging to the eigenvalue N'. We may 


arrange kets in the order ofN', that is a zero-particle state, one-particle states, two-particle states, • • •: 
(A17) l*«) - - vUll&o) > 

These kets are all orthogonal to each other, two kets referring to the same number of bosons are 
orthogonal as before, and two referring to different numbers of bosons are orthogonal because they 
have different eigenvalues N'. By normalizing the kets, we obtain a set of kets like (AT41 with no 
restriction on (n'}. These kets form the basic kets in a representation where (n a [ are diagonal. 


A.1.2 Observables 


We wish to express observable physical quantities (observables) for the system of identical bosons in 
terms of r\ and rf. These observables are, by postulate, symmetric functions of the boson variables. 

An observable can be written in the form: 


am 


+ + = Y + Z + 


where y® is a function of the dynamic variables of the jth boson, z® that of the dynamic variables of 
the ith and jth bosons, and so on. 

We take 


(A19) 


Y = . 


Since y® acts only on the ket |a®j of the jth boson, we have 

,•»(!<) 


(A20^ 

The matrix element 


(A21) ( a * ] 


does not depend on the particle index j. Summing (A201 over all j and applying operator S to the 
result, we obtain 


(A22 , *y(|“5}|“S)-)h-4 ■ 

Since Y is symmetric, we can replace S Y by Y S for the lhs. After straightforward calculations, we 
obtain, from (A22) . 










Y *iU *1, ■ ■ ■ i■*«} = Y, £ ■ ■ ■ *4 -1 -I^K^I^) 

I fl 

= £ E ^ ■ ■ • ^j-i^j+i 

0 h j 

(A23) ■■■|*«)5te j {a.|r|ab). 

Using the commutation rules and the property (A101 we can show that (Problem A. 1) 
(A 24 )* W ”*”"' I<P ^ = ^- v * lV * 1 "' —W d i , s ■ 

Using this relation, we obtain from (A23 ) 

. Yvhvti-- = EEviVfc(a«|yN)(7iiVia---|*to)) ■ 

(A25) » t 

Since the kets hx/hx,' ••• I ^o) form a complete set, we obtain 

y = EEvIvi{a.|rl«i) ■ 
f A26I TV 

In a similar manner, Z in (A18) can be expressed by (Problem A.2) 

. z = £EEE vWb t U t Ic{a a ab\z\a c ad), 

(A27) abet 

where (a a a b |z|a c a d ) is given by 

(A28) l«c«4 = («L 1, |{a?| z (12) |o j 2) ) . 

Problem A.l. Prove (A2A} . Hint Start with cases of one- and two-particle state kets. 
Problem A.2. Prove (A27) by following steps similar to (A22WA26) . 


A.1.3 Fermion Creation and Annihilation Operators 

In this section we treat a system of identical fermions in a parallel manner. 

The quantum states for fermions, by postulate, are represented by antisymmetric kets: 

(A29} = A (K1 KJ-’K 1 )) f 

where 

1 


A = 


E^ 


(A30) p 

is the anti symmetrizing operator, with 6 P being +1 or -1, according to whether P is even or odd. Each 

antisymmetric ket in IA29I is characterized such that it changes its sign if an odd permutation of 
particle indices is applied to it, and the fermion states a, b, ..., g are all different. Just as for a boson 
system, we can introduce observables n,, n 2 , ..., each with eigenvalues 0 or 1 , representing the 

number of fermions in the states oq, a 2 , ..., respectively. The many-fermion occupation-number state 
can be represented as shown in Figures A.2 . 


Figure A.2 A many-fermion state is represented by the set of fermion numbers {nj} occupying the 
state pj = 27thj/L. Each nj is restricted to 0 or 1. 




















, n-2 — 1, fi-i = D. no = I, 
= L n ? = Q t 


We can also introduce a set of linear operators (q, qfy one pair (q a , q a ') for each state a a , satisfying 
the Fermi anticommutation rules: 

{*..*1} = + vU* = = o. 

The number of fermions in the state a a is again represented by 

(A3 2) = nln* = »1 ■ 

Using tA31k we obtain 

(A33) n l - *1(i - vlv*) v* = = n * or n l - n * = 0 ■ 

If an eigenket of n a belonging to the eigenvalue n a is denoted by |n a ), IA33I yields 

(A34) K" "*) K) = « ~ <) K) = 0 ■ 

Since the eigenket n a is nonzero, we obtain n a (n a - 1) = 0, meaning that the eigenvalues n a are either 
0 or 1 as required: 

(A35)< = <> *>r 1 . 

Similarly to the case for bosons, we can show that (Problem A.3) 

IA361 l a . a i-"°£)A = vlvl — vll&v) 
which is normalized to unity. 

Observables describing the system of fermions can be expressed in terms of operators r\ and q ' , and 
the results have the same form (A261 and (A27) as for the case of bosons. 

In summary, both states and observables for a system of identical particles can be expressed in 
terms of creation and annihilation operators. This formalism, called the second quantization, has some 
notable advantages over the usual Schrodinger formalism. First, the permutation-symmetry property 
of the quantum particle is represented simply in the form of Bose commutation (or Fermi 
anticommutation) rules. Second, observables in second quantization are defined for an arbitrary 
number of particles so that the formalism may apply to systems in which the number of particles is not 
fixed, but variable. Third, and most importantly, all relevant quantities (states and observables) can 
be defined referring only to the single-particle states. This property allows one to describe the motion 
of the many-body system in the 3D space. In fact, relativistic quantum field theory can be developed 
only in second quantization. Furthermore, boson creation and annihilation can only be described in 
second quantization. 

Problem A.3. Show that IA361 is normalized. 


A.1.4 Heisenberg Equation of Motion 

















In the Schrodinger picture (SP), the energy eigenvalue equation is 
(A37) = 

where n is the Hamiltonian and E the eigenvalue. In the position representation this equation is 
written as 


where 'P is the wavefunction for the system. We consider one-dimensional motion for conceptional 
and notational simplicity. (For three-dimensional motion, (x, p) should be replaced by (x, y, z, p x , p y , 

p^) = (r, p).) If the number of electrons N is large, the wavefimction ¥ contains many particle 
variables (x 1? x 2 , ...). This complexity needed in dealing with many particle variables can be avoided 
if we use second quantization and the Heisenberg picture (HP), which will be shown in this section. 

If the Hamiltonian ^ is the sum of single-particle Hamiltonians: 

(A3 9) j 

the Hamiltonian ^ can be represented by 


(A40) . k • h 

where q a (q a ^) are annihilation (creation) operators associated with particle state a and satisfying the 
Fermi anticommutation rules in f A31 ) or the Bose commutation rules in (A3). 

In the Heisenberg picture a dynamical variable ^(t) changes in time, following the Heisenberg 
equation of motion: 

m) 


(A41) 


= [%,%] = %U- 


Setting = r\J, we obtain 


(A42) 


[u,nl\, 


whose Hermitian conjugate is given by 


- ([«.rf]) t —IW.1J ■ 

By the quantum postulate the observables t, are Hermitian: ^ = t,. Variables r\ a and r\ a are not 
Hermitian, but both obey the same Heisenberg equation of motion, see (A42) and (A43). 

We introduce the Hamiltonian tA40) into (A42), and calculate the commutator [n, r\J]. In such a 
commutator calculation, the following identities are very useful: 

(A44) [A, BC\ = [A, £]C 4- B[A P C] , [AB, C] = A[ B, C} + [A, C]B , 

(A45) [A- B Q = {A, B}C - £{A, C) , [AB, C} = A{B,C}~ {A, C)B . 

Note that the negative signs on the right-hand terms in (A45I occur when the cyclic order is destroyed 
for the case of the anticommutaton (A, B} = AB + BA. We obtain from 1A421 and (A431 (Problem 
A.4) 


















or 


c b 

= £ £ h vt) = E h ' »*I 

« b c 


-»fE‘4 


IA46I 

We take the Hermitian conjugation and obtain 

“tt-!>■*» ■ 
rA47) * . 

Equation IA46) means that the change of the one-body operator r\J is determined by the one-body 

Hamiltonian h. This is one of the major advantages of working in the HP. Equations (A46) and (A47I 
are valid for any single-particle states {a}. 

In the field operator language (A47) reads 
3V* (M) 


iA 


= h V>(M) , 


(A48) dt 

which is formally identical to the Schrodinger equation of motion for a particle. 

If the system Hamiltonian ^ contains an interparticle interaction 

(A49) V= ^/ dJr 

where v(r — r’) denotes the interparticle interaction potential, the evolution equation for i|/(r, t) is 
nonlinear (Problem A.5): 

_ / ihd\ , , 

In quantum field theory the basic dynamical variables are particle field operators. The quantum 
statistics of the particles are given by the Bose commutation or the Fermi anticommutation rules 
satisfied by the field operators. The evolution equations of the field operators are intrinsically 
nonlinear when the interparticle interaction is present. 

Problem A.4. Verify that the equation of motion IA46) and (A47), holds for bosons. 

Problem A.5. Verify (A50) . Hint: use (A46) . 


A.2 Eigenvalue Problem and Equation-of- 
Motion Method 


In this Appendix we set up the energy eigenvalue problem for a quasiparticle and illustrate the 
equation-of-motion method to obtain the eigenvalue of a many-body Hamiltonian of quasiparticles. 
















A.2.1 Energy-Eigenvalue Problem in Second 
Quantization 


Here we consider as a preliminary an electron characterized by the Hamiltonian: 



We may set up the eigenvalue equations for the position x, the momentum p, and the one-body 
Hamiltonian h as follows: 

(A52)*K> = *VH 
(A53) p\p') = J>V) - 
A541 = e d £ d = £y\v), 

where x’, p', and are eigenvalues. 

By multiplying A54I from the left by (x|, we obtain 



(A56) $Ax) = {x\v ). 

Equation A55) is just the Schrodinger energy-eigenvalue equation, and cj) v (x) is the familiar quantum 

waveftmction. If we know with certainty that the system is in the energy eigenstate v, we can choose a 
density operator p! to be 

A57I Pi = \v){v\, {v\v) = 1 . 

This pj is a one-body density operator for the system h in a pure state |vj. 

Let us now consider trUv^xIpj}, which can be transformed as follows: 

tftlHPdPi} = = £{x|v){v|a){a|v) = {*|v) . 

a a 

From this we can write the waveftmction cf> v (x) as follows: 

A58I <Pv( x ) = - 

where the symbol “tr” denotes a one-body trace. It can be seen from (A57) that the waveftmction 
4 > v (x) can be regarded as a mixed representation of the density operator p 1 in terms of the states (v, x). 
In a parallel manner, we can show that the waveftmction in the momentum space (j) v (p) = (p|v) can be 
regarded as a mixed representation of pj in terms of energy state v and momentum state p (Problem 


A.1): 


(Aft91 «Mp} = tqivHfPipij = {p|pi|v) . 

In analogy with Aft 81 we introduce a quasiwavefimction v P v (p) through 
(A60! ^Tr{v7a P p} , 

where i|/ v ^ is the energy-state creation operator, ap the momentum-state annihilation operator, and p a 
many-body-system density operator that commutes with the Hamiltonian 
(A61) [U,p] = o. 

In Equation (A601 the symbol “Tr” denotes a many-body trace. Equation A61) is the necessary 















condition that p be a stationary density operator, which is seen at once from the quantum Liouville 
equation: 

(A62) iA 97 = M - 

Let us consider a system for which the total Hamiltonian ^ is the sum of single-electron energies h: 

n = Th«K 
(A63) j 

For example, the single-electron Hamiltonian h may contain the kinetic energy and the lattice potential 
energy. We assume that the Hamiltonian 'h does not depend on time explicitly. 

In second quantization the Hamiltonian ^ can be represented by 

(A64) <> b a b 

where H a (H a ^) are annihilation (creation) operators, satisfying the Fermi anticommutation rules (A3 I t . 

We calculate the commutator [ 7 ^, \p a ' ]. After straightforward calculation we obtain (Problem A.2) 

[U, yjJ] = Eyipl = Y, ■ 

(A65) T 

Multiplying Equation (A65I by a p p from the right and taking a many-body trace, we obtain 
T Wi>,v = £,W, 

(A 66 ) ? 

which is formally identical to the Schrodinger energy-eigenvalue equation for the one-body problem 
(Problem A.3): 

(A67) T 

The quasiwavefunction v P v (p) can be regarded as a mixed representation of the one-body density 
operator n in terms of the states (v, p) (Problem A.4): 

(A 68 ) = ■ 

The operator n is defined through 
(A69) = {afc|»|c£ fc > = n ka . 

These n^ are called b-a elements of the one-body density matrix. 

We reformulate tA 66 I for later use. Using IA 6 U and (A65), we can write f A 66 t as in the form 
(Problem A.5): 

(A70I Tr ([7^Kp} =TI {vt[* 9 ,K\p}=s y ¥yte) f 

where we used the invariance of cyclic permutation under the traced The complex conjugate of 
(A70) is similarly given by (Problem A. 6 ) 

fA71t = V^p} ■ 

Either (A70 ) or fA71t can be used to formulate the energy-eigenvalue problem If we choose the 
latter, we may proceed as follows: 

1. Given ^ in the momentum space, compute [ 7 ^, ap 1 "]; the result can be expressed as a linear 
function of a p ^; 

2. Multiply the result obtained in (1) by i|/ v p from the right, and take a trace; the result is a linear 

















function of \[/ ; 

3. Use (A71 ) in the resulting equation obtained in (2), and we obtain a linear homogeneous 
equation for i|/*, which is a standard form of the energy-eigenvalue equation. 

The energy-eigenvalue problem developed here is often called the equation-of-motion method. 
Problem A.l. Verify (A59). 

Problem A.2. Derive (A65). Hint: Use identities (A49). 

Problem A.3. Derive (A61) from (A54) . 

Problem A.4. Prove (A68). 

Problem A.5. Derive f A70) . 

Problem A.6. Verify (A71) . 

A.2.2 Energies of Quasielectrons (or “Electrons”) at 0 K 

We re-derive the energy gap equations, utilizing the equation-of-motion method [3]. Below the 
critical temperature T c , where the supercondensate is present, quasielectrons move differently from 

those above T c . Here we study the energies of quasielectrons at 0 K. This ground state is described in 
terms of the original reduced Hamiltonian in (7.45) . see below. 

- E +e 2 £ «c&«’-£'£' 

k k k 

(A72) + vrf'i,? + mbfhf] . 

By using this ^ red , we obtain (Problem A.7) 



These two equations indicate that the dynamics of quasielectrons described in terms of c’s are 
affected by stationary pairons described in terms ofb’s. 

Now let us find the energy of a quasielectron. We follow the equation-of-motion method. We 
multiply (A73) from the right by \\r w ^ p 0 , where \\f Y l) is the “electron” energy-state annihilation 
operator and 

75 P« = \r){*\ - O' (>*? + WOrT ( w 2 + WW 

is the density operator describing the supercondensate, and take a grand ensemble trace denoted by 
TR. After using (A7Q) . the lhs can be written as 

(A76) TR {[*-* £t] *?»} = TR ■ 

where we dropped the subscript v; the quasielectron is characterized by momentum p and energy 
E p (fy The first term on the rhs simply yields sp^V/^Cp)- Consider now 

(All) = TR {C‘-n^V'K , *'l} ■ 

The state pF) is normalized to unity, and it is the only system state at 0 K. Hence we obtain 
















t _/w.LWt-W JiUi 

1A78) TR r* c — p°} — (" I®* c —I**' - 

We assume here that k ^ p, since the state must change after a phonon exchange. We examine the 

relevant matrix element and obtain (Problem A. 8) 

/fii Am , jn l(u\ Ltut , „m fc d)t\.. ii» u iii 
(A79) x * + v * b * ) b * ( w * + v t b t } 1°) = u t v t ■ 

We can therefore write 

(A 8o ) TRjb‘'V 1 „^V} = »WNl"(-'’»’ 

(A81) *7 H ’ 1 “ TR • 

Collecting all contributions, we obtain from (A73) 




(i) (i) 


*-v C2S (2) 

+ ^E u t v t 


<Vp) 


c p vpi = * 

(A82) 

Using (7.621 and (7.63) . we obtain 
A vW 11 x u VAt* 21 ,,® 

/ A ^ = * * + 1Z ^ * * ■ 

(A83) k k 

We can therefore simplify (A821 to 

fA84! ^^(p) = 4 1 ) V , ^(p)-^iV , [ 1) Up) ■ 

Similarly we obtain from (A741 

r (l) i. <11/ x <!>,(!), l A .111*! j 

(A85) H») = ~4 i-p)~ A ^\ tp> - 

Energy can be interpreted as the positive energy required to create an up-spin unpairing 
electron at p in the presence of the supercondensate. The energy E_ p W can be regarded as the positive 

energy required to remove a down-spin electron from the paired state (pf, _ p|). These two energies 
are equal to each other 

(A86) £ r’ “ _ £ j” > 0 ■ 

In the stationary state (A841 and (A851 must hold simultaneously, thus yielding 


(A87) 


pl'l ,w 

t p -E f 


A-i 

ci) , „w 

p 


Ei" + £, 


= 0 , 


whose solutions are = ±(8p^^ 2 + A^) 172 . Since E^ 1 ) > 0, we obtain 


pd) _ Aim , A 2\ 1/2 

(A88) Ep ~V P + A V ■ 

The theory developed here can be applied to the “hole” in a parallel manner. We included this case in 
(A88). Our calculation confirms our earlier interpretation that E p ( b is the energy of the quasielectron. 

In summary, unpaired electrons are affected by the presence of the supercondensate, and their 
energies are given by tA88f 
Problem A.7. Derive (A73) and (A74). 

Problem A.8. Verify (A78) . 


A.3 Derivation of the Cooper Equation (7.34) 






















We derive the Cooper equation (7.341 in this Appendix. 

Let us consider a Cooper pair. Second-quantized operators for a pair of “electrons” (i.e., 
“electron” pairons) are defined by 

(A89) 2 = = = C 4 C 3 ■ 

Odd-numbered “electrons” carry up-spins | and even-numbered carry down-spins [. The 
commutators among B and can be computed using the commutators among B and B^ along with the 
Fermi anticommutation rules, and they are given by (Problem A. 1) 


[fin. bL] 


Cz4 
ci 4 


(A901 [^12p = 0 ; 

tA91I ®12 = B 12 = 0 ' 

1 — tii — n 2 if Jfci = jt 3 and Jt 2 = ifc+ 
if Jti = Jt 3 and Jt 2 ^ 
if fci ^ Jt 3 and Jf 2 = 

(A92I (0 otherwise f 

where 

(A93) ni ~ c *it Cti t* ni - c t2^ Ct 2+ 
are the number operators for electrons. 

Let us now introduce the relative and net (CM) momenta (k, q) such that 


* = -(* 1 * 2 ) , q = k t + k 2 \ 


l+ h 


k * = ~ k + 2 q - 


(A94) 

Alternatively we can represent pairon annihilation and creation operators by 

(A95) ^*9 _ B *lt*2l = t+fl/2|Ct+4f2t t = 4+s/2f C —*+4/2+ ■ 

The prime on B will be dropped hereafter. In the k-q representation the commutation relations are re¬ 
expressed as 

(A96) [**«• B *V] = 0 > 1W = 0 > 

[*r »*] " 

1 - n t +q /2 1 - n-t+qpi if k = k* and q= q' 


£A97J 


C—1+ s/2 4 ^ /2+ 


Cfc-h|/ 2 t<V +972t 


if k ^ = k + "y 
and — Jfc ^ — k f + 
if + 

and — k + £ = —i k r + ^ 
otherwise . 


If we drop the “hole” contribution from the generalized BCS Hamiltonian in Equation (723} , we 
obtain the Cooper Hamiltonian ^ c : 

Kc = ''oE E E Vll) - 

(A98) *>o ‘ * ? 

where the prime on the summation means the restriction: 

(A99) 0< f (l* + f I) ’ s (H + fl) < *°”>- 

The Hamiltonian can be expressed in terms of pair operators (B, B^): 











^=Er Kl‘ + ID Mb*+II)R*' 

k q 

-’^E'E'E'®!.®*'.- 

(A100) 7^7 

Using (A961 and (A97), we obtain (Problem A.2) 

hRHTRII) + T|-* + !l)R 

fAlOU 77 9 

If we represent the energies of pairons by w v and the associated pair annihilation operator by cj) v , n 
c can be expressed by 

(A102) v 

This equation is similar to (A451 in Appendix A.l with the only difference that here we deal with 
pair energies and pair-state operators. We multiply (A101) by (|>vPgc fr° m the right and take a grand 
ensemble trace: 


w v a kq - w q a tq = Tr B* J 0 vP B c} 

- b (I*+fl) +£ (b* + !!)]“** 

— *Y. (1 - f U+5/2t - n -l+q/u)i>v) . 

(A 1031 V 1 

where a kq is defined by 

(A104) atq,v ~ {^9^ >v ^ c } 55 ■ 

The energy w v can be characterized by q, and we have w v = w q . In other words, excited pairons have 
net momentum q and energy w q . We shall omit the subscripts v in the pairon wavefimction: a kq v = a kq . 
The angular brackets mean the grand canonical ensemble average of an observable a'. 


{A) - Tr{^} - 


Tit4exp[lRAf-7f)]} 


(A105) *"* Trexp tfW-U)} * 

where ^ and represent the Hamiltonian and the number operator, respectively. 

In the bulk limit: N — » 00, V —> 00 while n = N/V = finite, where N represents the number of 
electrons, and k-vectors become continuous. Denoting the wavefimction in this limit by a(k, q) and 
using a factorization approximation, we obtain from (AT 031 

w,a(*, = {e (|* 4- ||) + £ (|-* 4-1|)[ q) 




(A106) 


ftfk'Mk ',,) {l - [ £ (|t +1|)] - /„[ E (|-k +1|)]) . 


(A107) _ + 1 “ ’ 

where f F is the Fermi distribution function. The factorization is justified since the coupling between 
electrons and pairons is weak. 

In the low-temperature limit (T —» 0 or |3 —*■ 00), 

(A108) <£ p > 0). 

We then obtain 














fA109> (I* + ID +£ (l"‘ + f 1)1 (2 ?l ' 

This equation is identical to Cooper’s equation, Eq. (1) of his 1956 Physical Review [4]. For a 2D 
system, replacing d 3 k/(27ih) 3 in tAl 091 for the 3D case by d 2 k/(27ih) 2 we obtain the Cooper equation 
(7.341 for the 2D case. 

In the above derivation we obtained the Cooper equation in the zero-temperature limit. Hence, the 
energy of the pairon, w q , is temperature-independent. 

Problem A.l. Derive (A90), (A911 and (A921 . 

Problem A.2. Derive (A101) . 


A.4 Proof of (7.94) 


The number operator for the pairons in the state (k, q) is 
(All 0) *** ~ = C t+ S / 2 C -t+q/l^t+q/Z^k+q/i , 

where we omitted the spin indices. We write n kq 2 explicitly, transform the middle factors and obtain 


K kq — C k+q{2 C —k+qf2 C —t+q/2 C t+qJ2 C k+q/2 C '—k+q{ 2 C —t+ql 2 C t+q[l 


= C i+g/2 C — k+q/2 — C —fc+ q/2 C_ ') 

^^1 J x (l — k+q/2^k+q/2 = n lq . 

where we used (A361 (the Fermi commutation rules), and (A381 in Appendix A. 1. Hence, we obtain 
(A112) ~ nt *) ^ = ~ n>t *) = 0 ■ 


Since |n' kq ) ^ 0, we obtain 

(All3) n tq = 0 or !■ 
We now introduce 


Hg = ^ ^ Stj , 

(A1141 k 
and obtain (Problem A.l) 

(All51 = ^^ — ~ = ^^1] = ■ 

Although the occupation number is not connected with B q as riq ^ B q t B q , the eigenvalues n' q of 
satisfying (A1141 can be shown straightforwardly to yield 


(All 6 ) < = 0 , 1 , 2 ,- , 
with the eigenstates 

(All7) 1°) > lh = ^| 0 ), | 2 »-b}b}w,-. 

The derivation of the boson occupation numbers n' = 0, 1,2, • ■ ■, from [q, n] = q, n = fy q, follows the 
steps after (A101 . and ending with (A171 in Appendix A.l. We may follow the same steps. By setting 
q = 0 in this equation, we obtain (7.851 . 

Problem A.l. Prove (A1151 . 




















A.5 Statistical Weight for the Landau States 

The statistical weight yv for the Landau states in 3D and 2D are calculated in this appendix. 


A.5.1 The Three-Dimensional Case 

Poisson’s sum formula [5, 6] is 

t, E E jwxr'”'. 

(A118i "=-°° 

where F is the Fourier transform of f, and the sum Xn=-oo G ° f(27in + t), 0 < t < 2n, is periodic with the 
period 1. The sum is by assumption uniformly convergent. 

We write the sum in (9.47^ as 

2 Y' t/ c — (2 « -h 1 )ji = (f — n) V2 -I- <P(e; 0), 

FA 1194 

oo 

4>{e\x)= Y (f — n — 2n\n 4-x|) 1/2 . 

(A120) 

Note that (j)(e; x) is periodic in x with the period 1, and it can, therefore, be expanded in a Fourier 
series. After the Fourier series expansion, we set x = 0 and obtain (All9) . By taking the real part (sr) 
of (All 9) and using (A118i and (9.47) . we obtain 




W(E) 


o m — i 

£ 

x / dt,£ - I)V1 ' 


' cos m t , 

(A12D 
where we assumed 
_ 2nE 

(A122) £ "a^7 

and neglected n against 8. The integral in the first term in (A1211 yields (2/3)e 3/2 , leading to w 0 i n 

(9.51L The integral in the second term can be written after integrating by parts, changing the variable 
(ms - mr = t), and using sin(A — B) = sin A cos B - cos A sin B as 


( ms m- e \ 

f . cost / . sint \ 

smms I dt———cosms / dt—— 

i * l 

We use asymptotic expansion for me = x » 1: 


(A124) i 


* . ,— 

/ suit hi cosx 

d, 7r~V7- vr 

0 

* f — 

/ cost hi smx 


(A125) i 

The second terms in the expansions lead to yv L in (9.52L where we used sin 2 A + cos 2 A = 1 and 


















m-1 
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y (~1) 

(A126) k m 
The first terms lead to the oscillatory term w osc in (9.53 ) . 


A.5.2 The Two-Dimensional Case 

We write the sum in (10.35) as 

oo 

0 ( 2n + h 31 != <9(*-*) + v j fr;°)> 

(A127) n=Q 

{X> 

w (£; x J = T 0{e — 7i - 2.xI h + x \) , 

(A128) 

Note that \\t (e; x) is periodic in x and can therefore be expanded in a Fourier series. After the Fourier 
expansion, we set x = 0 and obtain (A127) . By taking the real part (sr) of (A127) and using (All8k 
we obtain 


i COS VT , 


0t{(A127)} = - / dr@(f-r) + 2y(_i) 1 ' f drG{s-r)i 
(A129") * t n v=i { 

where we assumed s = 27iE/hoj c » 1 and neglected n against s. The integral in the first term in (A129) 

yields 8. The integral in the second term is 


l 

cos vr = — sin ve . 


I dr@(f — r) 

(A130) o 
Thus, we obtain 

1 2 °° (— IV 

9t{{A127H = — s H— T -—— sin vs 
(A131) 31 v 

Using (10.35) and (A 131) . we obtain 

W[T) = W 0 + W 0BC 


™ \ , 2 ~ (—ir . (2xvE\ 

= C|A ai c (— ) + Cfett> c — > -sin —- , 

\ n / ji ^ v V / 


(A132) 

which establishes (10.38)-(10.40) 


A.6 Derivation of Formulas (11.16)-(11.18) 

Let us start with a BCS-like Hamiltonian (11.8) . Dropping the “holes” from the Hamiltonian we 
obtain a Hamiltonian associated with “electrons” as 

Jc k i? Ir j/ 

= + f M+4|) Y, Y, ’ 

(A133) *9 9 * A J 

where = c un c u> i s the number operator for the “electron” (1) (fluxon (3)), e™ = cUjf^ 2 ciy + 2 is the 

pair operator for the “electron” and we suppressed the “electron.” Note that the prime on the 

summation in (A133) means the restriction: 0 < e k < frco D , co D = Debye frequency. Using the 






















anticommutation rules (11.9) we obtain 

(i-»,+*-»!?, +} ) • 

(A134) * 

The Hamiltonian is bilinear in (B, B’f), and can therefore be diagonalized: = X p w^^ly 

where is the energy and cj) p the annihilation operator. We multiply (A1 341 by (|) p from the right, 
take a grand canonical ensemble average, 
denoted by angular brackets, and get 


+ £ 


( 3 ) 

-k+i | 




^0 


WpV/tfaq) - («n+f| 1 “l-H-JI/ (2 jt h) 1 

( A135 ) */ j 1 - /f ( f l-f+4l) - /f (^- + j|)J ■ 


where {iy = f F (s p ) is the Fermi distribution function. The reduced wavefimction T^k, q) = {B kq ty p } = 
(|x|n|k, q) can be regarded as the mixed representation of the reduced density operator ii defined 
through (k% q’|n|k, q> = <B kq + B kkq y The fc-boson energy co p can be specified by (N L , q), and it will 
be denoted by co 0 since it is N L -independent. As T —► 0, f F (s p ) —► 0. Dropping the fluxon energy and 
replacing q/2 by q, we obtain (11.16) . We solve this equation, assuming s F » hco D . Using a Taylor 
series expansion, we obtain (11.17) to the linear in q. 

Now we derive (11.18) . The BEC occurs when the chemical potential p vanishes at a finite T. The 
critical temperature T c can be determined from 


(A136) " = {2jlh) ~ 2 f A2p[ ^ ~ 11_13 = (kBTcrl ■ 

After expanding the integrand in powers of e - P cs and using s 

ft = 1.654(2 ji 1 -1 | *' Bi ' ) , 

(A137) ■ ' \*c) 

yielding a general formula (11.18) for 2D BEC. 


cp, we obtain 
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